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Abstract 

Legumes, and a very few non-legume plant species, arc known to possess functioning haemoglobin genes. 
We describe here the characterization of a haemoglobin cDNA isolated from barley. The deduced amino 
acid sequence shows 71 % amino acid identity with a non-legume haemoglobin gene, a further 16% of 
the residues being conservative replacements. The barley cDNA also hybridizes to genomic sequences 
in rye, maize and wheat. The demonstration of a gene from a monocotyledon with close sequence 
homology to the known non-legume plant haemoglobins fills a major gap in the known distribution of 
haemoglobin genes in the plant kingdom. The expression of the gene is induced in isolated barley aleurone 
layers exposed to anaerobic conditions, and the roots of flooding-stressed barley plants. The expression 
of the RNA under anoxic conditions is similar to thai of a known anaerobic response gene, alcohol 
dehydrogenase. Our results suggest that the increased expression of haemoglobin RNA is an integral part 
of the normal anaerobic response in barley. The findings are discussed in the light of current theories 
of haemoglobin function and evolution. 



Introduction 

Plant haemoglobins (Hb) in the form of leghae- 
moglobins (Lb) have, for many years, been known 
to exist in the root nodules of legumes [27, 30]. 
Haemoglobins have more recently been positively 
identified in two closely related members pf the 
Ulmaceae Parasponia andersonii (nodulating) and 
Trema tomentosa (non-nodulating), and in one 
species Casuarina glauca of the Casuarinaceae 
[8> 29]. There is considerable support [$ T 10, 14, 



31 ] for the hypothesis that plant and animal Hbs 
have a common evolutionary origin. However, , 
the inability to demonstrate the presence of an Hb 
gene in monocotyledons has left open the possi- 
bility of inter-kingdom horizontal transfer to the 
dicots [8, 31].. There have been no reports which 
unequivocally demonstrate the presence of an Hb 
gene in a monocotyledon [3, 6], although claims 
of Lb-like sequences in the genomes of a number 
of non-legumes have appeared [36, 37]. These, 
however, were based only on Southern blotting 



The nucleotide sequence data reported will appear in the GenBank Nucleotide Sequence Database under the accession number 
U01228. 
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experiments and the results have been questioned 
[3]. Several authors [8, 31] have noted the limi- 
tations which have frustrated attempts at cross 
species screening using available antibody and 
DNA probes. 

The role of Lb in legume synibioses has been 
well-documented [2J. It is believed to act as an 
oxygen carrier in the symbiosomes of root nod- 
ules, supplying oxygen to the bacterial respiratory 
chain whilst preserving a low free-oxygen concen- 
tration [2]. The sequences of other plant Hb pro- 
teins are quite distinct from those of the Lb type 
and have been reviewed in some detail by 
Arredondo-Peter and Escamilla [6). Current 
knowledge of the properties of non-legume plant 
Hb is based largely upon studies of the Parasponia 
protein. This protein has a very high oxygen 
affinity (Km 89 nM), and shows rapid oxygen 
rebinding (geminate reaction) (17,43]. Trans- 
genic studies have shown that the Parasponia Hb 
promoter is activated in the root meristem and 
vascular bundle in tobacco, and additionally ex- 
pressed in root nodules of Lotus [9, 31]. 

The main body of work on the response of 
plants to anaerobiosis has dealt with glycolytic 
metabolism. Seed and root tissues that are toler- 
ant of hypoxic conditions respond to oxygen defi- 
cit by increased utilization of glycolytic pathways 
[28]. In barley aleurone tissue, for example, low 
oxygen stress results in increased expression of 
alcohol dehydrogenase and lactate dehydroge- 
nase RNA and protein [20, 22, 26]. Whilst 
changes in glycolysis are clearly major factors in 
plant adaptation to hypoxia, there are probably 
other factors involved. U is likely that under 
otherwise anoxic conditions roots can receive 
some oxygen input by axial movement from the 
aerial parts of the plant [4, 5, 1 1 ]. It has also been 
suggested that facilitated diffusion of oxygen and 
the accompanying buffering of oxygen tensions 
may be significant factors for cellular metabolism 
[40]. 

We have isolated a barley cDNA clone which 
has strong sequence homology to other non- 
legume Hbs. We have examined the Hb transcript 
levels, under varying exposures to oxygen stress, 
in comparison to the response of alcohol and 



lactate dehyrodegenase, two glycolytic enzymes 
that have been studied extensively in the normal 
anaerobic response of aleurone tissues. We also 
examined the behaviour of the gene in root tissues 
under flooding (hypoxic) conditions. On the basis 
of our results, we suggest that all monocots, and 
probably all plants, may in fact possess an Hb 
gene. Our evidence also indicates that the expres- 
sion of the barley Hb gene is highly induced in 
response Zo anaerobic stress, and that this prop- 
erty of the gene is not limited to barley. The 
implications of these findings are discussed. 

Materials and methods 

Materials 

Wheat (cv. Biggar), rye (cv. Musketeer) and 
maize were available locally. Barley seeds (cv. 
Harrington) were provided by the Canadian 
Grain Commission, Winnipeg, Manitoba. The 
Adh 1 probe was a gift from £. S, Dennis, and 
the barley Ldh probe was a gift from A, D. 
Hanson. 

<* 

Library screening and sequence analysis 

Clones were isolated from a Agt22 cDNA library 
of RNA from aleurone layers of Harrington bar- 
ley half seeds, incubated with 1 gibberellic 
acid for 5 days [11,34], The initial Hb cDNA 
was isolated serendipitously in an immunological 
screen for other proteins. This cDNA was then 
used as a probe to identify full-length clones. The 
library was screened with the radiolabeled Hb 
cDNA probe according to Maniatis etaL [32]. 
cDN A inserts were subcloned into bluescript SK 
plasmid, and sequenced using a Gibco/BRL cycle 
sequencing kit. Sequence analysis and manipula- 
tion were carried out using the FASTA and FSAP 
packages and other software [15, 16, 35], 

Southern bhtttog 

Genomic DNA samples were prepared from 
seedling leaves by a modification of the CTAB 
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method 138] and aliquots of 10 or 20 Mg DNA 
were digested with appropriate restriction en- 
zymes before size fractionation on agarose gels in 
Tris- acetate buffer. DNA was blotted onto Hy- 
bond N + (Amersham) membrane in 20 x SSC 
overnight and fixed by alkali and UV treatment. 
Hybridisations were carried out overnight in a 
mixture containing 6 x SSC, 5 x Denhardt's so- 
lution, 1% SDS, 100 jig/jd herring testis DNA, at 
65 *C Washes were carried out at 65 °C in 0.5 % 
SDS plus varying concentrations of SSC. Washes 
were initially performed in 2 X SSC, then in i x 
SSC, 0.5 x SSC, or finally 0.2 x SSC as appro- 
priate. Filters were exposed to Kodak XAR-5 
film with screens at -75 6 C. Fragment sizes 
were estimated relative to Gibco/BRL size stan- 
dards. 



Northern blotting 

Total RNA was prepared from frozen seedling 
leaves, seedling roots, coleoptiles of 10 day dark 
germinated seeds, and aleurone layers (usually 
batches of 75 aleurone layers per sample) accord- 
ing to the procedure described by Mohapatra 
etal (33]. RNA was quantitated by UV absorp- 
tion spectroscopy [32] and size-fractionated on 
agarose/formaldehyde gels [32]. Loading equiva- 
lence and RNA integrity were confirmed by 
ethidium staining of gels prior to transfer, RNA 
was blotted to GeneScreen (NEN) or Hybond 
N + (Amersham) membranes according to the 
manufacturer's instructions, and fixed with UV, 
Hybridisation, washing, and exposure to film, 
were carried out as described for southern blot- 
ting. Transcript sizes were estimated relative to 
Gibco/BRL RNA size standards, which were vi- 
sualised by methylene blue staining [32]. Blots 
were re-probed in sequence with Hb probe, Ldh 
probe and Adh probe. 



Labelling was performed using 32 P-dCTP (ICN), 
unincorporated label was removed by spinning 
through Sephadex G50 (Pharmacia), Probes were 
added at concentrations of 1-5 x 10* cpm per ml 
of hybridisation fluid. Higher probe concentra- 
tions were used for cross-species Southern blots 
where signal strengths were extremely low. 



Preparation of aleurone layers 

Aleurone layers from barley (Hordeum vidgare cv. 
Harrington) were isolated by normal procedures 
[11,34]. Briefly, embryo-free barley half seeds 
were sterilised in 0.25% bleach for 30 min, rinsed 
four times for 5 to 10 min in sterile water. After 
3 days imbibition at 22 °C, aleurone layers were 
separated. Twenty or 25 layers were placed in 
sterile 50 ml conical flask containing 3 ml H 2 0, 
1 GA, 10 *ig/ml chloramphenicol, 10 mM 
CaCl 2> and incubated on a rotary shaker running , 
at 65 cycles/minute, at ca* 24 °C 



Oxygen stress treatments 

Isolated aleurone layers were . incubated under 
controlled oxygen tensions essentially as de- 
scribed by Hanson and Jacobsen [22]. For time 
course incubation under N 2 , each flask was 
flushed with high purity N 2 for 2 min, then flaslj;? 
were placed in 3 litre jars, which were in turn 
purged with N 2 for 1 h. The jars were reflushed 
with N 2 for 30 min every 6 h thereafter* For var- 
ied oxygen tensions, aleurone flasks were placed 
in 1-litre jars; these were sealed and flushed with 
the appropriate mixture of 0 2 and N 2 gases for 
15 min. Jars were re-purged with the same gas 
mixture every 3 to 6 h. Samples were harvested at 
appropriate time points, frozen in liquid nitrogen, 
and stored at -75 °C 



Preparation and use of probes 

Purified plasmid inserts were radioactively la- 
belled according to standard procedures [32]. 



Flooding experiments 

Batches of 10 barley or maize seeds were planted 
in a mixture of peat, sand, and soil and allowed 
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to germinate normally up to the two leaf stage. 
Individual pots were used for each treatment. At 
the start of the experimental treatment, an appro- 
priate number of pots were placed in a deep tray 
of water, so as to completely submerge the root 
system up to soil level. For sampling, the contents 
of whole pots were removed and washed to clear 
the roots of soil and debris. Root tissue was fro- 
zen in liquid nitrogen and used to prepare total 
RNA as described above. 



Extraction and assay ofADH and LDH 

Batches of 6 aleurone layers were homogenised 
using a blender (Polytroo, Brinkman) at top speed 
(qx 30 s in a 10 ml plastic tube containing 2 ml 
ice-cold isolation buffer (0.5 M Tris-HCl pH 8.0, 
10 mM DTT)« The homogenate was clarified by 
centrifugatton at 10000 rpm (12000 X g) in a Sor- 
vall SS34 rotor for 20 min. The supernatants were 
transferred to 1.5 ml microfuge tubes and respun 
at 10000 rpm for 20 min. The supernatants were 
used for assays. Lactate dehydrogenase (EC 
1.1.1.1) and alcohol dehydrogenase (EC 1.1.1.27) 
assays were performed exactly as described by 
Hanson and Jacobsen [22]. Total protein con- 
centrations were standardised using a Biorad 
protein assay kit with y -globulin as a standard. 
Enzyme activity was expressed in IU (jimo\ of 
NADH oxidised or reduced per minute). Each 
time point represents the results of three separate 
measurements. 



Results 

Cloning and sequencing 

During the course of immunologically screening 
an aleurone cDNA library with antiserum for the 
1,6-a-glucosidase, limit dextrinase, a cDNA was 
isolated which encoded a protein with strong se- 
quence homology to Parasponia andersonii and 
Trema tomentosa Hbs [8, 31]. It is possible that 
this is due to some contaminant in the protein 
used to raise antiserum, but the serum does not 



detect an Hb-sized protein on western blots. The 
initial cDNA isolate represents bases 354 to 706 
of the sequence presented here (Fig. 1). This frag- 
ment was used as a probe to re-screen the library 
yielding a number of clones of varying lengths, the 
sequence of the longest clone is the basis for the 
data presented. Partial sequencing of a number of 
independent clones indicated no rearrangement 
of the cDNA sequence during cloning. It also 
revealed tfo variations in nucleotide sequence 
between different clones* 

The complete open reading frame encodes a 
predicted translation product 162 amino acids 
long (17.8 kDa). The predicted protein sequence 
shows 71 % amino acid identity with Parasponia 
Hb> with a further 16% of residues being conser- 
vative replacements. The sequence is clearly dis- 
tinct from the Lb class of proteins, relative to 
which it shows ca. 40% sequence identity and a 
further 40% of conserved changes. Residues con- 
served between known dicotyledons and' animal 
Hb [6] are also conserved in the barley Hb. These 
residues are underlined in Fig* L The nucleotide 
sequence is likewise highly conserved relative to 
other non-legume Hbs, being 70 % homologous to 
Parasponia. Sequence comparisons of different 
plant Hbs have revealed a consensus for nine 
specific residues amongst those which show rapid 
oxygen rebinding (geminate plus) [ 17], six of these 
are also conserved in the barley and Parasponia 
sequences. Two of the changed positions (Thr-92 
and Gln-149) are identical in the barley and Para- 
sponia sequences (see Fig. 1). The barley protein 
also shares the cysteine residue (residue 79 in the 
barley protein) seen in all the sequenced non 
legume plant Hbs and in none of the sequenced 
Lbs [6] 



The liaemoglobin DNA sequence is present in the 
genomic DNA of barley and other cereals 

Southern blotting (Fig. 2a) confirmed that the 
cDNA sequence was present in the barley ge- 
nome. Hybridising fragments were ca. 5.0 kb with 
Bam HI, 4.9 kb with Bgl II, 7.5 kb with Hind III 
and 4,0 kb with Bam HI and Hind III combined. 
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Fig. I . Nucleotide sequence or a barley Hb cDNA clone with predicted translation product (blyhb), showing similarity to Parusponla 
Hb [8] (parhb). + indicates identical residue, - indicates gap inserted to optimise similarity. Amino acid residues conserved 
between animal and dicot Hbs [6] arc underlined. Positions of the nine residues conserved amongst the geminate plus (fast oxy- 
gen rebinding) Hbs are shaded. Thr-92 and Gln-149 replace Leu and Val, respectively, in the geminate plus consensus (double- • 
underlined) [17]. The conserved cysteine (Cys-79) is also indicated (double-underlined). Nucleotide and protein sequences are 
numbered. 
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Fig . 2 . Southern blots probed with barley Hb cDN A. a. B ariey 
DNA digested with, Sam HI and HM III (B + H), Bam HI 
(BaX Bgl II (Bg), Mind III (H). b. Rye, maize and wheat DNA, 
digested with Bam HI (Ba) or HM III (H). Samples were run 
on a 1 % agarose gel and blotted as described in Materials and 
methods. Positions of size markers are indicated in kb. Un- 
hybridised probe whs removed by washing in 2 x SSC and 
then 0.5 x SSC/0.5%; SDS (rye, maize, wheat) or 0.2 x SSC/ 
0.5%; SDS (barley), at 65 °C Blots were exposed to film at 
-75 "C 



cate the presence of multiple, related sequences. 
None of the blotting results suggest the presence 
of a multigene family as seen with the Lbs. 

The barley probe also hybridised to genomic 
sequences in a number of other cereals (Fig. 2b), 
even at quite high stringency. In rye, Bam HI gaye 
a 4*8 kb hybridising band, and Hind HI, 7.5 kb 
and 5.0 kb bands. In maize, Bam HI gave a frag- 
ment of 4.8 kb and Hin6 III 7.5 kb. Within the 
limits of cross-species probing, it seems that in 
both of these cases, as in barley and Parasponia, 
the gene is present at low copy number. Again, 
even at low stringency no evidence was seen for 
a multigene family as observed amongst the Lbs. 
In hexaploid wheat, there are three hybridising 
fragments with each enzyme, Bam HI giving 
bands at 6.3, 2.4 and L8 kb, and Hin6 III giving 
bands of 7.6, 5.0 and 3.8 kb. It is possible that 
each of the three bands represents one chromo- 
some complement. We cannot rule out the pOsS- 
sibility, however, that these three bands represent 
a gene family. 
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The haemoglobin RNA is expressed in roots and 
aleurone layers 

Northern blot analysis of barley RNA indicated 
a transcript of ca. 900 nucleotides, which was 
present in both total and poly(A) + RNA (Fig. 3a 
and b). A transcript was easily detectable in total 
RNA isolated from the roots of seedlings at the 
one- and three-leaf stages. There was no detect- 
able expression of the transcript in leaf tissue of 
the same plants, This is consistent with the ex- 
pression pattern seen in the non-legume dicot 
species which have been examined [3, 8]. 

In addition to its expression in root tissue, the 
Hb transcript can be detected at very low levels 
in coleoptile tissue (Fig. 3 a) and in oxygen- 
stressed aleurone layers (Fig. 3b) of imbibed half 
seeds. 



Effect of 0 2 concentration on the expression of 
haemoglobin, alcohol dehydrogenase and lactate 
dehydrogenase mRNA expression 

Since the expression of Hb message increased in 
response to 0 2 deprivation, a series of experi- 
ments were conducted in which aleurone layers 
were incubated under imposed oxygen concen- 
trations ranging from 20% to 0% 0 2 for a period 
of 24 h. Hb mRNA was not detectable at 20% 



and 10% 0 2 was barely visible at 5% 0 2 and 
became easily detectable at 2% and 0% 0 2 
(Fig, 4a). The same blots used to examine the Hb 
message were re-probed for Adh and Ldh tran- 
scripts. Adh transcript levels increased steadily 
as the 0 2 concentration was decreased from 10% 
to 0% . Ldh message was noticeably elevated be- 
tween 2% and 10% with a lower but easily 
detectable basal level of expression in 0% and 
20% D a . 



Temporal expression of haemoglobin, alcohol dehy- 
drogenase and lactate dehydrogenase RNA in 
aleurone layers under anoxic conditions 

The expression of Hb, Ldh and Adh messages 
was examined in aleurone layers under sustained 
anoxia (Fig, 4b). Increased levels of Hb message 
were first observable after 1 h of incubation under 
N 2 , and continued to increase up to 12 h, the last 
sampling point in this experiment. No significant 
changes in Hb message level were observed be- 
yond 12 h (results not shown). Increases in Adh 
and Ldh transcript levels also became apparent 
after the first hour of anoxia. After the first two 
hours of treatment changes became less pro- 
nounced. High basal levels of Ldh message were 
apparent in the 0 h and 12 h control (incubated in 
normal air) samples. 
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Fig. 3. Northern blots of barley RNA probed with barley Hb 
cDNA. a, 30 ^tg tola! RNA from, colcoplilcs (C), leaves (L), 
and roots (R). Growth stages of 1 leaf and 3 leaf are indicated, 
b, 0.5 jig poly A + RNA from roots of 3-leaf stage seedlings 
(R). 20 pg total RNA from aleurone layers (AJ) at a zero time 
point (T 0 ) and after 24 h in normal atmospheric conditions 
( + ), or oxygen deficient conditions ( - ). Approximate tran- 
script size is shown in kb, Gels were run and blotted as 
described in Materials and methods. Blots were washed in 
0.2 x SSCat65 Q C, 



Changes in enzyme activity of ADH and LDH under 
anoxia and hypoxia 

As a confirmation that the aleurone preparations 
were behaving normally under oxygen stress, ac- 
tivity assays for LDH and ADH enzymes were 
carried out on aleurone layers exposed to a range 
of imposed O a concentrations, and under sus- 
tained anoxia. When aleurones were incubated 
under concentrations ranging from 20% to 
0% (Fig. 4a, top), the results indicated that ADH 
activity remained low until the 0 2 concentration 
dropped below 10%, activity then increased 
steadily up to a maximum in 0% 0 2 LDH activity 
rose as the 0 2 concentration was decreased to 
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Fig. 4. Response of aleurone layers to (a) different oxygen concentrations, and (b) sustained anoxia. Northern blots were probed 
in sequence with Hb probe, Ldh probe, and Adh probe. Each Lane carries 1 5 total RNA, blots were washed at 0.2 x SSC 65 D C. 
In a, numbers represent percentage oxygen, from 0 to 20%. In b, numbers represent rime in hours after beginning the experiment, 
from 0 to 12 hours. Graphs show the corresponding changes in ADH and LDH cruyme Activity observed in similar experiments. 
The standard deviation for each measurement is shown. The time-course of enzyme activities shown in b is over 72 h to allow 
direct comparison with the results of Hanson etaL [22], Gels were run and blotted as described in Materials and methods. Blots 
were washed in 0.2 x SSC at 65 °C. 



10%, remained relatively constant over the range 
of 10% to 2% 0 2 , then declined slightly at 0% 

Under sustained anoxia (Fig. 4b, top) ADH 
activity increased rapidly during the first 12 h, 
and continued to increase up to 48 h. Following 
a slight decline in the first 6 h, LDH activity in- 
creased steadily until 48 h. A comparison of 
anoxic and oxygenated aleurone layers after 72 h 
indicated about an 8-fold increase in ADH ac- 
tivity and less than a 2-fold increase in LDH 
activity. 

These results are very similar to those of 
Hanson and Jacobsen [22]. They confirm that 
the aleurone preparations behaved normally in 
response to low oxygen stress. They place results 



regarding Hb transcription in the context of a 
normal anaerobic response. 



Levels of the haemoglobin transcript are increased 
in flooded roots of barley and maize 

In view of the increase in Hb transcript levels in 
oxygen-stressed aleurone layers it was important 
to know if this finding was also relevant to the 
gene in its major site of expression, root tissue. 
Barley and maize roots were examined for Hb 
RNA expression under flooding stress. Figure 5 
shows RNA samples from the flooded and un- 
flooded plants, probed with the Hb cDNA. After 
a 1 2 h period of flooding stress, both maize and 
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fig. $. Hb transcript expression- in RNA from the roots of 
barley and maize plants. Lanes are (C) control roots, and (F) 
flooded root systems, completely submerged in water for 12 h. 
Each lane carries 1 5 /ig of total RNA. Gel was run arid blotted 
as described in Materials and methods. Blot was washed in 
0.2 x SSCat65°C. 



barley expressed greatly increased levels of Hb 
. ; RNA. In this exposure the Hb transcript is barely 
visible in control roots, but is clearly visible in the 
flooding-stressed samples. 



Discussion 

It is now generally accepted that the known ani- 
mal and dicot plant Hbs evolved from a common 
ancestor about 1400 million years ago [10, 14, 
31]. However, m the absence of a monocot Hb 
gene, the possibility that the dicot gene is a result 
of inlerkingdom horizontal transfer has remained 
troubling [3], The demonstration of a cereal Hb 
gene confirms that it is present in two of the major 
divisions of the plant kingdom. This, in turn, 
makes it all the more likely that an Hb gene is 
indeed present in all plants [3, 6, 14], and perhaps 
serves a common function there [3], The conser- 
vation of kinetically important residues between 
the barley and Parasponia Hbs (Fig. 1) suggests 
that the barley protein is functional and that il 
may share kinetic properties with die Parasponia 
protein. Some authors have suggested that the 
identification of a monocot Hb gene may circum- 
vent one conceptual obstacle in the generation of 
artificial nitrogen-fixing symbioses involving 
monocot plants [3]. 

Based on the available information concerning 
the expression pattern and kinetic properties of 
non-legume plant Hb, two major hypotheses have 



been put forward to account for the expression of 
the protein in non-nodulating plants. One involves 
Hb-facilitated diffusion of oxygen within the root 
tissues [3, 9). Another proposes a role as an oxy- 
gen sensor, detecting changing levels of environ- 
mental oxygen and causing the switch from aero- 
bic to anaerobic metabolism [3, 9]* There is 
already evidence from work on mammalian and 
rhizobial systems [ 18, 19] that at least some oxy- 
gen sensors are haem proteins. However, work 
on the response of E. coli to low oxygen stress 
suggests that the switch mechanism need not rely 
on monitoring oxygen levels at all [39]. Work on 
a range of experimental systems has suggested a 
number of other functions for proteins of the Hb 
family, including reduction of metal ions [7], and 
a role as a terminal oxidase [ 12, 13]- These and 
many other possibilities have been extensively 
reviewed by Wittenberg and Wittenberg [42], 

Whether the major role of plant Hb is as an 
oxygen sensor, or as a facilitator of oxygen diffu- 
sion, the expression of the gene would be ex- 
pected in organs such as roots and seed tissues 
(the alcurone and embryo itself) which might be 
expected to experience oxygen stress • [3, 22]. 
Aside from one unconfirmed report of an Hb pro- 
tein in seed tissues [25], there have been no pre* 
vious reports of Hb expression other than in root 
and root nodule tissues. 

If the Hb protein has some functional role in 
the anaerobic response (as a facilitator of oxygen 
diffusion for example), one would expect it's RNA 
levels to be up-regulated under low oxygen stress. 
That is, it should behave in much the same way 
as other genes involved in the anaerobic response 
[ 1, 22, 24, 41], Experimental analysis showed that 
if barley alcurone layers are subjected to a low 
oxygen regime (Fig. 4X the level of the Hb RNA 
does indeed increase dramatically. The increase 
in transcript levels in response to low oxygen 
levels is apparent at lower 0 2 concentrations for 
Hb than it is for Adh or Ldh. The timing of the 
Hb response to anoxia is similar to that of the 
Adh and Ldh genes, the increase in Hb transcript 
levels seems to be relatively large. Taken together 
with the changes in ADH and LDH enzyme ac- 
tivity, these results indicate that the induction of 
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Hb RNA expression is occurring within the 
framework of a normal anaerobic response [20, 
22, 23]. The Hb RNA induction parallels raChcr 
than anticipates the expression of the Adh and 
Ldh messages. This would not be expected if 
the only function of the protein were as an oxy- 
gen sensor triggering the anaerobic response and 
thus the increased expression of these two tran- 
scripts. 

An examination of the expression of the Hb 
RNA in roots of both barley and maize gives a 
similar picture, the gene is again very responsive 
to low oxygen stress, The results demonstrate that 
this responsiveness to 0 2 levels is not limited to 
the aleurone system. It is also important to note 
that the Hb gene shows same 0 2 responsiveness 
in at least two different monocot species. 

The expression of an Hb gene in roots and 
aseptically cultured aleurone layers of a monocot 
with no known symbiotic partners strengthens 
the proposition that plant Hbs do have a more 
universal function than their role in symbioses [3, 
31, 42]. If this is true, together with the high level 
of sequence conservation between the barley and 
dicot non-legume Hbs, it provides some circum- 
stantial support for the proposal that even le- 
gumes may still possess a copy of the 'non-Lb 1 
gene, from-which the Lbs themselves have arisen 
for specialist purposes by gene duplication and 
subsequent evolution [3, 31]. Such a conserved 
locus could easily have been overlooked due to 
the well documented problems inherent in cross 
species screening with available Hb probes 
[3, 31], In our own work, we have observed that 
Lb probes and barley Hb probes do not cross 
hybridise even at very low stringency and in the 
presence of a large excess of the target sequence 
and the probe (unpublished data). 

Hb transcription in a well characterised cell 
system* the barley aleurone layer, provides a suf- 
ficiently defined system for transfection experi- 
ments to directly address the question of Hb func- 
tion and its possible role in the anaerobic response 
[21, 22]. A clearer picture of this response is cru- 
cial to a better understanding of the factors which 
lead to flooding tolerance in some cereals and not 
in others [24], We are currently pursuing experi- 
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ments designed to directly assess the physiologi- 
cal significance of the Hb protein. 
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Although nonsymbiotic hemoglobins (Hbs) are found in different 
tissues of dicots and monocots, very little is known about hb genes 
in monocots and the function of Hbs in nonsymbiotic tissues. We 
report the cloning and analysis of two rice {Oryza sativa L.) hb 
genes, hbl and hb2, that code for plant Hbs. Rice hbl and hb2 
genes contain four exons and three introns, as with all of the known 
plant hb genes. At least three copies of the hb gene were detected 
in rice DNA, and analysis of gene expression shows that hbl and 
hb2 are expressed in leaves but only hbl is expressed in roots. A 
cDNA for rice Hb1 was expressed in Escherichia coli, and the 
recombinant Hb (rHbl) shows an unusually high affinity for 0 2 
because of a very low dissociation constant. The absorbance spectra 
of the ferric and deoxyferrous rHbl indicate that, in contrast to 
symbiotic Hbs, a distal ligand is coordinated to the ligand-binding 
site. Mutation of the distal His demonstrates that this residue coor- 
dinates the heme Fe of ferric and deoxyferrous rHbl and stabilizes 
O a in oxy-rHbl. The biochemical properties of rice rHbl suggest 
that this protein probably does not function to facilitate the diffu- 
sion of 0 2 . 



Hbs are widely distributed throughout higher plants, 
including both dicots and monocots. Comparison of pro- 
tein sequences and analysis of gene expression suggest that 
two families of Hbs, the symbiotic and nonsymbiotic, exist 
in higher plants (Appleby, 1992; Andersson et al., 1996). 
Symbiotic Hbs are detected only in N 2 -fixing nodules, but 
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not in other plant organs or in non-N 2 -fixing plants. The 
function of Hbs in legume nodules is to facilitate the dif- 
fusion of 0 2 to the N 2 -fixing bacteroids (Appleby, 1984, 
1992). Nonsymbiotic Hbs are found in both dicot and 
monocot plants, and thus appear to be more widely dis- 
tributed in higher plants than symbiotic Hbs (Appleby, 
1992; Andersson et al., 1996). 

A Hb that exhibits characteristics of both symbiotic and 
nonsymbiotic Hbs was isolated from nodules of Parasponia 
andersonii, a nonlegume infected by rhizobia (Appleby et 
al., 1983). The P. andersonii hb gene is expressed in both root 
nodules and noninfected organs (Bogusz et al., 1988; 
Appleby, 1992), and codes for a protein that has 0 2 -binding 
kinetics similar to symbiotic Hbs, which suggests that both 
Hbs have a similar function in nodules (Gibson et al., 1989). 
The P. andersonii hb gene contains three introns located 
identically to the symbiotic hb genes (Landsman et al., 
1986). A hb gene has also been cloned from Trema tomentosa, 
a nonnodulating relative of P. andersonii (Bogusz et al., 
1988) that is expressed in roots but not in leaves (Bogusz et 
al., 1988). Nonsymbiotic hh genes have been cloned from 
other dicots such as soybean {Glycine max L.) (Andersson et 
al., 1996) and Arabidopsis (Trevaskis et al., 1997). The 
soybean hb gene is similar to nonsymbiotic hbs and it is 
expressed in diverse organs, with the highest level of ex- 
pression detected in stems (Andersson et al., 1996). 

A Hb transcript from a monocot was cloned by Taylor et 
al. (1994) using an aleurone cDNA library from barley 
(Hordeum vulgare L.), and its corresponding gene sequence 
was recently deposited in the GenBank database (accession 
no. U94968). The amino acid sequence of the predicted 
barley Hb is similar to nonsymbiotic Hbs. A single copy of 
the hb gene apparently exists in barley. It is expressed in 
roots of plants grown under normal conditions, but it is 
expressed at higher levels when the plants are grown un- 
der microaerobiosis, suggesting that the expression of the 
hb gene may be associated with the anaerobic response. 



Abbreviations: DDBJ, DNA Data Bank of Japan; EST, expressed 
sequence tags; Hb, hemoglobin; Mb, myoglobin; RGRP, Rice Ge- 
nome Research Program (Japan); rHb, recombinant hemoglobin. 
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Incomplete sequences of cDNAs coding for rice Hbs 
have been deposited in the DDBJ database, and the se- 
quences for the predicted proteins have been published by 
Andersson et al. (1996). Very little information is available 
on the hb genes in monocots and the biochemical properties 
of nonsymbiotic Hbs, and nothing is known about the 
function of these proteins in plants. A nonsymbiotic Hb has 
been isolated from barley seeds (Duff et al., 1997), and 
recombinant nonsymbiotic Hbs have been studied (Duff et 
al, 1997; Trevaskis et al., 1997). These proteins exhibit 
unusual spectra when reduced and unligated and possess 
a very high affinity for 0 2 . In this work we report the 
cloning and analysis of two hb genes, hbl and hbl, from 
rice, 

A cDNA coding for Hbl was expressed in Escherichia coli, 
and the recombinant wild-type protein (rHbl) and a mu- 
tant in which the distal His was replaced by a Leu were 
spectroscopically and kinetically characterized. Our results 
show that rice hb genes are similar to other nonsymbiotic 
hbs, and are differentially expressed in roots and leaves of 
rice plants grown under normal conditions. rHbl shows 
spectral characteristics similar to other Hbs, but the spectra 
for the ferric and deoxyferrous forms are unusual. Further- 
more, despite the very high affinity for 0 2 , rHbl appears to 
function through a unique mechanism in which the distal 
His binds to the heme Fe in the deoxyferrous state, but 
repositions to stabilize bound O z , resulting in an extremely 
low dissociation rate. 



MATERIALS AND METHODS 

Plant Growth and Total DNA and RNA Isolation 

Rice (Oryza sativa var. Jackson) seeds were germinated 
for 5 d and then planted in pots containing vermiculite. 
Rice plants were grown in a greenhouse at 22°C with 
light/ dark periods of 16 h/8 h and watered with tap water 
every 3 d and with nutrient solution (Becana et al., 1991) 
every 6 d. Plants were grown for 5 weeks and then the roots 
and leaves were collected, washed, and immediately fro- 
zen. Total DNA was isolated from roots or leaves using a 
modification of the cetyltrimethylammonium bromide 
method (Doyle and Doyle, 1990). Poly(A + ) RNA was iso- 
lated from rice roots or leaves using a QuickPrep mRNA 
purification kit (Pharmacia) and quantitated by spectro- 
photometry, assuming IA Z60 = 40 /xg/mL (Ausubel et al v 
1995). 



Sequencing of Two cDNAs Coding for Rice Hbl or Hb2 

Rice cDNA clones with sequences similar to plant Hbs 
were generated by the RGRP (Sasaki et al., 1994) and 
deposited in the DDBJ database as EST sequences. We 
obtained the clones C741 and C2576 (DDBJ accession nos. 
D15507 and D38931, respectively), from the RGRP, which 
were fully sequenced. Clones C741 and C2576 were named 
rice Hbl and Hb2, respectively. 



Oligonucleotides and PCR Amplification 

Primers were designed for PCR to amplify the hbl or hbl 
genes using the sequences that are immediately upstream 
and downstream from the start and stop codons of the rice 
Hbl or Hb2 cDNAs. The oligonucleotide sequences were: 
Hbl/ 5' (sense), 5'-TAAACCAGCTGTCAGGAAGCA-3'; 
Hbl/3' (antisense), 5'-AGCAGCT-AGCATGCCTGTCGA- 
3'; Hb2/5' (sense), 5'-AGGAATCAAATCGAAGCAGCC- 
3'; and Hb2/3' (antisense), 5' -GGAGGTGGAGCAGT- 
ATATATA-3'. Total rice DNA (approximately 0.5 /x.g) was 
used as the template for PCR amplification. PCR compo- 
nents and concentrations were: 0.5 ixm of each sense and 
antisense primer, 200 jam of each dNTP, and 0.4 unit of Taq 
DNA polymerase (Gibco-BRL) in lx PCR buffer. PCR was 
done in a final volume of 10 fit using a rapid-cycling 
apparatus (Idaho Technology, Idaho Falls, ID). Amplifica- 
tion was carried out for 35 cycles at 65°C (for hbl) or 60°C 
(for hbl) I '30 s for annealing. PCR products were isolated 
from the agarose gel using the GeneClean kit (Bio 101, 
Bio-Rad) and then cloned into the vector pCRII (Invitrogen, 
San Diego, CA) following standard procedures (Sambrook 
et al., 1989). Cloned fragments were sequenced and DNA 
sequences were compared with sequences deposited in the 
GenBank database using the BLAST program (Altschul et 
al., 1990). Additional computer analyses were done using 
the GCG package (Genetics Computer Group, Madison, 
WI). 



Southern-Blot Analysis 

The clone with the rice hbl gene was labeled with 
digoxigenin-ll-dUTP (Boehringer Mannheim) by PCR as 
described by Lu et al. (1993). Rice total DNA was digested 
with restriction enzymes, and the samples were Southern- 
blotted using standard procedures (Sambrook et al., 1989). 
Membranes were hybridized at 55°C overnight with the 
rice hbl probe, washed at high stringency (60°C) in 2X 
SSC/0.1% (w/v) SDS twice for 5 min each, and in 0.5X 
SSC/0.1% (w/v) SDS twice for 15 min each, and then 
incubated in the NBT/x-Phosphate mix of the Genius kit 
(Boehringer Mannheim) to develop color. 



Expression of hb Genes in Rice Organs 

Expression of the rice hb gene was examined in roots and 
leaves of 5-week-old plants by RNA-PCR (Wang et al v 
1989) using a kit (Cetus). Reverse transcription was done 
using 30 ng of poly(A + ) RNA as the template and oligo 
d(T) 16 as the primer following the manufacturer's protocol. 
PCR amplification was performed using the specific prim- 
ers for the amplification of rice Hbl or Hb2 cDNAs and the 
same conditions used for the isolation of the rice hbl and 
hbl genes (above), but for 40 cycles. For a positive control, 
ubiquitin transcripts were amplified using primers that 
were designed from sequences located at positions 109 to 
130 (sense: 5 ' - ATGC AG ATCTTCGTG A AG ACCC-3 ' ) and 
316 to 336 (antisense: 5 '- ACCTCC ACG A AGGCGC AG- 
GAC-3') of a rice ubiquitin cDNA (Nishi et al., 1993). PCR 
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products were detected in 2% (w/v) agarose gels after 
staining with EtBr. 

Expression of the Rice Hb1 cDNA in Escherichia coli and 
Spectroscopic and Kinetic Characterization of the 
Recombinant Wild-Type and Mutant Proteins 

A cDN A coding for rice Hbl was amplified by PCR using 
the primers 5'-CC4TGGCTCTCGTGGAGGATAAC-3' 
(sense) and 5'- GAA7TCT CACTCCGCGGGCTCCATCTC- 
3' (antisense) that are located at the start and stop codons 
of the hbl gene, respectively, and were degenerated with 
restriction sequences for Ncol or EcoRI (underlined). PCR 
conditions were the same as above. The PCR fragment for 
Hbl was subcloned into the expression vector pET28a (No- 
vagen, Madison, WI) and transformed in E. coli as de- 
scribed by Hargrove et al. (1997) and Arredondo-Peter et 
al. (1997). The Kunkel method (Kunkel, 1985) of mutagen- 
esis was used to generate an H74L (His— »Leu) mutant 
starting from the cloned cDNA for Hbl. Recombinant rice 
Hbs (wild type and mutant) were purified by (NH 4 ) 2 S0 4 
precipitation and chromatography on DEAE-cellulose, and 
then partially sequenced from their N terminus using stan- 
dard procedures (Jun et al., 1994a, 1994b). Pure recombi- 
nant Hbs were spectrophotometrically characterized as de- 
scribed by Arredondo-Peter et al. (1997), and O z - and CO- 
binding affinities were determined as described by 
Hargrove et al. (1997). 

RESULTS AND DISCUSSION 

Sequencing of Two Rice cDNAs That Code for Hbl and 
Hb2 and Analysis of the Predicted Proteins 

Partial sequences (approximately 300 bp) of two cDNAs, 
corresponding to the clones C741 and C2576, with high 
similarity to nonsymbiotic hb genes, have been generated 
by the RGRP program and deposited in the DDBJ database 
(Sasaki et al., 1994). We fully sequenced the clones C741 
and C2576 in both directions, and the sequences were 
compared with sequences deposited in the GenBank data- 
base. Clones C741 and C2576 were found to be highly 
similar to plant Hbs, so they were designated Hbl and 
Hb2, respectively. The rice Hbl and Hb2 clones are 812 and 
786 bp in length, contain putative polyadenylation signals 
at positions 772 and 747, and code for predicted proteins of 
166 and 169 amino acid residues, respectively (Fig. 1). 

Sequence comparisons show that the predicted Hbl and 
Hb2 proteins are 93% similar to each other, and that the 
rice Hbs are 68 to 82% similar to nonsymbiotic Hbs and 
about 50% similar to symbiotic Hbs. Rice Hbs contain distal 
(H77) and proximal (H112) His residues, as well as the P52, 
F58, F82, and F122 that are conserved in plant Hbs (Fig. 2) 
(Arredondo-Peter and Escamilla, 1991). Rice Hbl and Hb2 
also contain a single Cys residue, C86, that is highly con- 
served in nonlegume Hbs (Arredondo-Peter and Escamilla, 
1991; Taylor et al., 1994; Andersson et al., 1996). Andersson 
et al. (1996) reported a second Cys at position 92 (number- 
ing as in Fig. 2) from partial sequences of rice Hbl and Hb2. 
However, after sequencing many clones of rice Hbs we did 



A 

CCACGCGTCCGGTTGTTTTCAGAGCCCAGCTAGCTCTCGATCATTTGTTA 50 

CAGAGAAATTGATCAAAGCAGGAiAT TAAACCAGCTGTCAGGAAGCAA TG 100 

M 1 

GCTCTCGTGGAGGATAACAATGC CGT AGC GGTGAGCTTCAGCGAGGAGC A 1 50 

ALVEDNNAVAVSFSEE Q 18 

GGAGGCGCTGGTGCTCAAGTCATGGGCGATCTTGAAGAAGGATTCCGCCA 200 

EALVLKSWAILKRDSA 34 

ATATTGCCCTCCGCTTCTTCTTGAAGATCTTCGAGGTCGCGCCGTCGGCG 250 

NIALRFFLKIFEVAPSA 51 

AGCCAGATGTTCTCGTTCCTGCGAAACTCCGACGTGCCGCTCGAGAAGAA 300 

SQMFSFLRNSDVPLEKN 68 

CCCCAAGCTCAAGACCCACGCCATGTCCGTCTTCGTCATGACATGCGAGG 350 

PKLKTHAM SVFVMTCE 84 

CCGCCGCGCAGCTGCGGAAAGCCGGGAAGGTCACCGTGAGAGACACCACC 400 

AAAQLRKAGKVTVRDTT 101 

CTCAAGAGGCTCGGCGCCACGCACCTCAAGTACGGCGTCGGAGACGCCCA 450 

L KRLGATHLKYGVGDAH 118 

CTTCGAGGTGGTGAAGTTCGCGCTGCTTGACACGATCAAGGAGGAGGTTC 500 

FEVVKFALLDTI KEEV 134 

CGGCGGACATGTGGAGCCCGGCGATGAAGAGCGCGTGGAGCGAAGCCTAC 550 

PADMWSPAM KSAWSEAY 151 

GACC ACCTGGTC GCTGCC ATC AAGC AGGAGATGAAGCC CGCGGAGTGATC 600 

DHLVA AIKQEMKPAE* 166 

GAC AGGC ATGCTAGCTGCTCC AC C TC CATGATC CTC GCCTC GC GAGTC CG 650 

ATTAGCTTTTGTTGCTTTCAAATGCTCGTTTCATATTCATCGTGTCCCAC 7 00 

AAAAAAAGGAGTGTGTATGTGGTGTACGATTGGTGCACGCTCCTGCTGTT 750 

TTCTTCTC GTGAT AAGACAT AAATAAAGATGGTTTTCT AC GCTAAAAAAA 800 

AAAAAAAAAAAG 812 



B 

CCACGCGTCCGGrK^GTTGAATTGAGCTCGAATTGTACTCGATTCACCA 50 

CACAGAGG^TCAAATCGAAGCAGCCATGGCTCTCGTGGAGGGAAACAAC 100 
MALVEGNN 8 

GGCGTGTC GGG GGG AGCGGT C AGC TTC AGC G AGG AGC AGGAGGCGC TTGT 150 
GVSGGAVSFSEEQEALV 25 

GCTCAAGTCGTGGGCC^T^TGAAGAAGGATTCCGCCAACATTGC^CTCC 200 
LKSWAIMKKDSANIGL 41 

GCTTCTTCTTGAAGATCTTCGAGGTCGCGCCGTCGGCGAGCCAGATGTTC 250 
RFFLKI FEVAPSASQMF 58 

TCGTTCCTGCGCJACTCCGACGTGCCGCTCGAGAAGAACCCCAAGCTCAA 300 
SFLRNS DVPLEKNPKLK 75 

GACCCACGCCATGTCCGTCTTCGTCATGACATGTGAGGCCGCCGCGCAGC 350 
THAMSVFVMTCEAAAQ 91 

TGCGGAAAGCC GGGAAGGTCACCGTGAG AGACACCACCCTGAAGAGGCTC 4 00 

LRKAGKVTVRDTTLKRL 108 

GGCGCCAC GCAC TTC AAGTACGGC GTCGGAGACGCC C ACTTTGAGGTGAC 4 50 

gath fkygvgdahfevt 125 

gaggttcgcgctxx;ttcagacgatx:aaggaggcggitccggtggacatgt soo 

r f a l l e t ikeavpvdm 141 

GGAGCCCCGCGATGAAGAGCGCGTGGAGCGAAGCCTACAACCAACTGGTC 550 

WSPAMKSAWSEAYNQLV 158 

G^GGCCATCAAGCAGGACtATGAAGCCTGCTGAGTGATATATATACTGCT^ 600 

AAIKQEMKPAE* 169 

CACCTCC ATGATCCTCGCTGATC AACTTTCTTGCATTGTGCTCGTTCAAT 650 

attcctcgccccacaaaagggacttttgtcggtgtgtgtatgtccaaatg 7 00 

attaaatcaact<k:tgttttgttctatgtaagatacataac^^ 750 

AAGATGGTTTTCTACATGCAAAAAAAAAAAAAAAAG ' 786 



Figure 1 . Nucleotide and deduced protein sequences of clones C741 
(A) and C2576 (B) that code for rice Hbl and Hb2, respectively. 
Sequences used to design the sense and antisense primers for the 
specific amplification of Hbl and Hb2 are underlined. Putative 
polyadenylation signals are double-underlined. 
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1 50 
Rice Hbl HALVE DNNAV . . . AVSFBKE QBAX.VLR6HA ILKKDSANIA LRFFLKIPBV 
Rice Hb2 HALVEOiraOV SGCAVSK6EB QEAX-VLKSKA IKKKDSANIG LRFFLKIFEV 

Barley Hb KflA AECAW7SKB KBALVLKBWA IKKKDSANLG LRFFLKIPTSI 

Paraaponia Hb MSS SEVHKVFTEE QBALWKAWA VKKKHSAELO LQFFLKIFBl 

Trans Hb MSG SHVDKVPTEB QEALWXSWA VMKKHSABLG LKFFLKXPSX 

CABU&rina Bb2 MS TLKGRQFTBB qbalwksws amkphagblg lkfplkifbi 

Soybean Bb MT TTLSRGFSES QSALWKSWN VHKKNSGELQ LXFFLKIFEl 

CasuariJia Hbl KM/TEX QEALLKQSWB VLKQKIPABS LRLFALILBA 

Cowpea Lbll MVAFSDK QB0I.VH0AYE AFKADIPKTS WFYTTILBK 

Phaaoolua Lbl • • .MOAFTEK QEALVHSSWE AFKGNIPQYS WFYTSXLBX 

soybean lba MVAFTEK QDALVSS8FE AFKAHIPQYS WFYTSILBK 

M. truncatula Lbl • .KSFTDK QEALVNSSYE AFKQNLSGYS VFPYTVILEK 

M. BAtlva Lb MOFTDK QEALVNSSWB SFRQN.PGNS VLFYTIILEK 

Sesb&nla Lb2 MGFTBK QEALVNASYE AFKQKLPGHS VLFYSFH-BK 

* * I * 

51 100 



Rice Hbl APSASQMPSF LRNSDVPLSK KPKLKTHAMS VFVMTCEAAA QLRKAQKVTV 
Rice Hb2 APSASQHFSF I>RNSDVPLEK NPKLKTHAM3 VFVMTCEAAA QLRKAGKVTV 
Barley Hb APSARQMFPF LKDSDVPLET KPKLKTHAV5 VFVMTCEAAA QLRKAOKITV 
Paraaponia Hb APSAKNLFSY JJCDSFVFLBQ NPKLKPHATT VFVMTCESAV QLRKAGKVTV 
Trema Hb APSAXNLTSY LKDSPIPLEQ NPKLKPHAMT VFVHTCSSAV QLRKAQKVTV 
CA3uarina Hb2 APSAQKLFSF LKDSNVPLBR UPKLKSKAMS VFLMTCESAV QLRKAGKVTV 
Soybean Bb APSAQKLFSF LRDSTVPLEQ HPKLKPHAVS VFVMTCDSAV QLRKAGKVTV 
Caauarina Hbl APESKYVFSF tKDSHEIPEN NPKLKAHAAV XFKTICESAT ELRQKGHAVW 
Cowpea Lbl I APAAKNLFSF L . . ANGVDAT NFKLTGHABK LFGLVRPSAA QLRASGGW. 
Pbaaeolua Lbl APAAKNLFSF L. . AHGVDPT KPKCTABAES LFGXVRDSAA QLRAHGAW. 
Soybean lba APAAKDLFSP L. .AHGVDPT HPKLTOHABK LFALVRDS&G QLKASGTW. 
M. trvncatula Lbl APAAKGLFSP LKDSAGV.QO SPQLQAHABK VFGLVRPSAS QLRATGGWL 
M. Sativa Lb AFAAKGMFSF l.DSAGV.QD SPKLQSHAEK VFGKVRpSAA QLRATGGWL 
Sasb&nia Lb2 apaargkfsf lkdsdgvpqh hpslqahaek VFGLVRDSAA qlratgvwl 
4* * * *»***, ** * 

101 150 
Rice Hbl RDTTLKRLGA THLKYGVGDX HFEWRFALL DT IKE EVP AD KHSPAMK3XH 
Rice Hb2 RDTTLKRLGA THFKYGVGOA HFBVTRFALL ETIKBAVPVD MWSPAHKSAW 
Barley Hb RHTTLKRLGG THLKYGVADG HFBVTRFALL BTIKHALPAD KWGPBMRHAW 
Parasponia Hb KBSDLKRIGA XHFKTGWNB BFEVTRFALL ETIKEAVP.B KWSPEMXHAW 
Trema Hb RBSHLKRLGA JHFKNOWNE HFE . TRFALL ETIKEAVP.B KWSPEKKHAW 
Casu&rina Hb2 RESSLKKLGA 5HFKHGVADB BFKVTKFALL ETIKBAVP.K TWSPEMKNAW 
Soybean Hb RESNLKKLGA TH7RTGVANB HFEVTKFALL ETIKEAVP.B MWSPAMKNAW 
C&suaxlna Hbl DNHTLKRLGS IHLKHKITDP HFEVKKGAU, GTIKEAIK.E HWSDBMGCW» 
Cowpea Lbl I ADAAL. . .GA VUSQKAVNDA QFVWKEALV KTLKEAVC.D KWSDBLGTAV 
Phaaeolua Lbl ADAAL. . .OS XHSQKGVNDS QFLWKEALL KTLKEAVG.D KWTDELSTAL 
Soybean lba ADAAL. . .OS vhaqkavtdp QFVWKEALL KTIKAAVG.D kws dels raw 
M, trurtCStula Lbl GDAAL. . .GA JHIQKGWDP HFVWKEALL KTIKBAAS. D KWS EELS TAW 
H. aativa Lb GDATL. . .GA IHIQKGWDP HPAWKEALL KTIKEVSG.D KWSEELNTAW 
Sesbania Lb2 ADASL. . ,GS VHVQKOVLOP HFVWKEALL KTLKEAAG.A TWSDEVSHAW 



151 170 
Rice Hbl SBAYDHLVAA IKQEKKPAE 
Rice Hb2 SBAYUQLVAA IKQEKKPAE 

Barley Hb geaydqlvaa IKQEKKPAE. 
Parasponia Hb GVAYDQLVAA IKFEHKPSST 
Trema Hb GEAYDQLVAA IKSEMKFSST 
Caauarina Bb2 GEAYDKLVAA IKLEKKPSS. 

Soybean Hb GBAYDQLVDA IKSEMKPPSS 
Caauarina Hbl TEAYHQLVAT XKAEKKE . . . 

Cowpea Lb I J ELAYDBLAAA XKKAY 

Phaaoolua Lbl BLAYDSLAAA IKKAYA. . . . 

Soybean lba EVAYDBLAAA XKKA 

H. trvncatula Lbl EVAYDALATE XKKAKS .... 

M. aativa Lb EVAYDALATA IKKAHV 

Sesbonia Lb2 EVAYDGLSAA XKKAMS 

** • ** 

Figure 2. Sequence alignment of nonsymbiotic and selected symbi- 
otic Hbs. Distal (H77) and proximal (H112) His residues are shown 
in bold type, and Cys residues are underlined. Intron I is at position 
47, intron II is at position 85, and intron 111 is at position 1 25; asterisks 
show the most conserved residues. Amino acid sequences were 
obtained from the GenBank database using the following accession 
numbers: U01 228 (barley Hb), M36509 [Parasponia andersonii Hb), 
Y00296 {Trema tomentosa Hb), X53950 {Casuarina gfauca Hb2), 
U47143 (soybean Hb), L28826 (C. glauca HbD, U33207 (cowpea 
Lbll), K03152 {Phaseolus vulgaris Lbl), V00453 (soybean Lba), 
X57732 {Medicago truncatula Lbl), M32883 (M. sativa Lb), and 
X13815 (Sesbania rostrata Lb2). Alignment of sequences was done 
using the PileUp routine of the GCG program. 



not detect a Cys C92, but rather a Leu L92, which is highly 
conserved in plant Hbs (Fig. 2). Thus, our results support 
the conclusion that rice Hbs contain only one Cys residue, 
which is located at position 86. 



Cloning and Analysis of Rice hb Genes 

To clone the gene for rice Hb, we used total rice DNA as 
the template with specific primers for rice Hbl or Hb2, 
which generated PCR products of approximately 900 bp. 
The PCR products were purified, cloned, and sequenced 
(Fig. 3), and the coding sequences of the PCR products 
were identical to cDNA sequences for Hbl and Hb2; there- 
fore, they were the rice hbl and hb2 genes. 

The hb gene and cDNA sequences were compared to 
identify the exon and intron (IVS) sequences of the rice hb 
genes. Rice hb genes have four exons and three introns, 
with the introns located at the same position as all of the 
known plant hb genes. The exon /intron boundaries of the 
rice hb genes are identical to the boundaries of the Paras- 
ponia andersonii hb gene (Appleby et al., 1988) (Fig. 3). 
Homologous introns of rice hbl and hbl were very con- 
served, with 92% similarity for IVS-II, 84% similarity for 
IVS-III, and 72% similarity for IVS-I. The highly conserved 
location of the introns and sequences of the exon /intron 
boundaries in monocot and dicot hb genes suggest that the 
ancestral hb gene of flowering plants had three introns in 
an identical location. 

When rice DNA was digested with restriction enzymes 
and then subjected to Southern blotting with the rice hbl 
probe at high stringency, between one and three hybridiz- 
ing fragments were detected (Fig. 4). Rice hb genes have no 
restriction sites for the enzymes used to cut the DNA, so at 
least three copies of the hb gene exist in rice. The existence 
of cDNAs encoding for Hbl and Hb2 (Fig. 1) indicates that 
hbl and hbl are functional genes in rice. 

Expression of hb Genes in Rice Organs 

In contrast to symbiotic hb genes, which are expressed 
only in nodules of N 2 -fixing plants, nonsymbiotic Hb tran- 
scripts have been reported to exist in many tissues, includ- 
ing: (a) root meristems of Trema tomentosa (Bogusz et al., 
1988), (b) root vascular bundles of transgenic tobacco (Nico- 
tiana tabacum) (Bogusz et al., 1990), (c) seed aleurone and 
roots grown under microaerobiosis of barley (Hordeum vul- 
gare) (Taylor et a'L, 1994), and (d) diverse organs of soybean 
(Glycine max) (Andersson et al., 1996) and Arabidopsis 
(Trevaskis et al., 1997). 

To determine the pattern of hb gene expression in rice, 
we isolated poly(A + ) RNA from roots and leaves, and then 
subjected it to PCR using specific primers for rice Hbl or 
Hb2. Amplification products were detected for Hbl in 
roots, and for Hbl and Hb2 in leaves (Fig. 5). Hb transcripts 
of approximately 550 bp were cloned and sequenced / and 
the resulting sequences were identical to those of the rice 
Hbl and Hb2 cDNAs, indicating that hb genes are detect- 
able and functional in rice roots and leaves. We did not 
detect any Hb2 transcripts in rice roots using our protocols, 
suggesting that the hbl gene is probably not expressed in 
the roots of rice grown under normal growth conditions. 
The pattern of expression of rice hb genes is similar to the 
expression of hb genes in Arabidopsis reported by 
Trevaskis et al. (1997). The differential expression of hbl 
and hbl genes in the rice plant indicates that these genes 
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hbl 1 ATOGClCTCGrpGCAOGATAACAATCCCGT . AGCGGTGAGCTT 41 

IIIIIIIIMIfllfl Mill i III mm Mill 

hb2 I ATCGCTCTCGTGGAGGGAAACAACGGCGTCTCGGGGGGAflCGGTCAGCTT SO 
42 CACCOAOGlAGCAGGAGOC^TCMTGCTCAAOTCATOGOCGATCTTGiAAOA 91 

lllllllllllllllllllll lllllliillt Mill Ml MMII 

51 CAGCGAGGAGCAGGAGGCGCTTGTGCTCAAGTCQTGGGCCATCATGAAGA 100 
9 2 AGGATTCCGCCAATATTGXXICTCCGCTTCTTCTTGAA^tatgtac . . atg 139 

1 1 j 1 1 1 1 1 1 1 ! 1 1 MM illllllllllllMMIIIIIIII I 

101 A6GATTC<^CCAACATTGGACTCCGCTTCTTCTTG AAjjt a tg tact a eta 150 
140 cqtgttacta ccattt ctctttttgcggaatc 171 

i ii inn 1 1 1 1 1 1 ii in in i i 

151 catgctactactcactagtgccatttttcgcagagtcagtttacggcacc 200 
1*)2 agagattgggtt. tgtgaagcat . , taaattgagcaatgcat . .ttcgct 216 

i him n in mi n in i in i ii i i 

201 aaccattggtttctgtagtgcatcatacattttggtgtgcgtgatttggt 250 
217 gatacatgtgtgtctgattgtgttgta^^TCTTCGAGGTCGCGCCGTCG 266 

I III I I II I M IIMilllllHMIIIIIIMIf 

251 gctac tatatttttattccagGATCTTCGAGGTCGCGCCGTCG 293 

267 GCGAGCCAGATGTTCTCGTTCCTGCGAAACTCCGACGTGCCGCTCGAGAA 316 

IINIilillUinnMIIMMI IIMMnMniMlllllMI 

294 GCGAGCCAGATGTTCTCGTTCCTGCGCAACTCCGACGTGCCGCTCGAGAA' 343 
317 GAACCCCAAGCTCAAGACCCACGCCATGTCCGTCTTCGTCATCgitaatac 366 

ii Nut ii nun n i iimii imniMiitmmumi 

344 GAACCCCAAGCTCAAGACCCACGCCATGTCCGTCTTCGTCATG^taatac 393 
367 taccatcattat . . . ttcaggca . . agtaaatttgttgtggtagtagaca 411 

MINIM 111 I III II IMIIMIIMIIl lllllll II 

394 taccatcagtat^gctgcagacaagagtaaatttgttgtagtagtaggca 443 
412 ctgacagaatgtgtgcgtgcgtcgcgatcaatcgatattgcagACATGCG 461 

I M I ! ! M 1 1 1 1 1 II M 1 1 MM IMMMIMI IMIMMII! I 

444 ctgacagaatgtgtgcgtgtgtcgtgatcaatcgatgttgcagACATGTG 493 
462 AGGCCGCCGCGCAGCTGCGGAAAGCCGGGAAGGTCACCGTGAGAGACACC 511 

1 1 1 1 1 1 1 1 1 1 ! II M I! II 1 1 1 1 1 1 1 II 1 1 M I II 1 1 1 M M 1 1 1 M I II 

494 AGGCCGCCGCGCAGCTGCGGAAAGCCGGGAAGGTCACCGTGAGAGACACC 543 
512 ACCCTCAAGAGGCTCGGCGCCACGCACCTCAAGTACGGCGTCGGAGACGC 561 

I Mi I Ml 1M MMII MM II II I M 1 1 1 1 1 i! 1 1 U 1 1 1 1 II II I 

544 ACCCTOAAOAGGCTCG(XKXX:ACGCACTTCAAGTACGGCGTCGGAGACGC 593 
562 CCACTTCGAGgtacagtgatccccaatggctgcctgcgctccatitcgatc 611 

HUM MMII MUM III I I I 1 1 1 1 1 M 1 1 1 1 1 1 

594 CCACTTTGAGgtagagtgatatccagttgtt. .... .gctccattcgatc 637 

612 gacatgaaac. . . .ttgatcgttttctgatcgtgtctttgtcgaacaacg 657 

1 1 1 1 1 1 in i nun iimii m i mini! 

638 gacatggaactctatcgatcgtgttctgattgtggccttgtcgaa 682 

658 tacatgcgatcgwtcgatcgtgtawaca^GTGGTGAAGTTCGCGCTGCTT 707 

Ihll 1 1 1 : 1 1 1 ; 1 1 ! M MMMMIMM 

683 cgttc gtaaac agGTG ACQAGGTTCGCGCTGCTT 716 

708 GACACGATCAAGGAGGAGGTTCCGGCGGACATGTGGAGCCCGGCGATGAA 757 

II MIIMMIlttl MIIIMi I MUM M 1 1 1 i M MMMM 

717 GAGACGATCAAGGAGGCGGTTCCGGTGGACATSTGGAGCCCCGCGATGAA 766 
758 GAGCGCGTGGAGCGAAGCC TACGACCACCTGGTCGCTGCCATC AAGCAGG 807 

MIMIIIMIIIMMIIMI Mil MMMM M 1 1 II II II II I 

767 GAGCGCGTGGAGCGAAGCCTACAACCAACTGGTCGCGGCCATCAAGCAGG 816 
808 AGATGAAGCCCGCGGAGTGA 827 

illlllllll II MMM 

817 AGATGAAGCCTGCTGAGTGA 836 

Figure 3. Sequence alignment of rice hbl and hb2 genes. Coding 
and noncoding sequences are shown as upper or lowercase, respec- 
tively. Sequences flanking the exon/intron boundaries that are con- 
served in rice and P. andersonii sp. (Appleby et al., 1988) hb genes 
are underlined. 



are not linked to each other, and that each hb gene is 
probably regulated by different promoter sequences and 
rnrns-acting factors. 




Figure 4. Southern-blot analysis of total rice DNA using the rice hbl 
gene as a probe. Signals were detected by colorimetry using the 
chromogenic mix NBT/x-phosphale. Molecular size markers are 
shown in ki lobars. 



Expression of the Rice Hbl cDNA in E. coli and 
Spectroscopic and Kinetic Characterization of the 
Recombinant Wild-Type and Mutant Protein 

Nonsymbioric Hbs are of low abundance in plant tissues 
and thus are difficult to isolate and purify from plants. The 
synthesis of recombinant proteins provides a useful 
method of producing large amounts of protein for bio- 

Hoots .Leaves 
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Figure 5. Agarose gel electrophoresis of the RNA-PCR products that 
were obtained using as a template poly(A* ) RNA isolated from rice 
roots or leaves and specific primers for rice Hb1 or Hb2. Arrow 
shows the approximately 550-bp fragments that were cloned and 
sequenced. Rice ubiquitin (Ubi) was used as positive control. Mo- 
lecular size markers are shown in base pairs. 
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Figure 6. Absorption spectra of rice wild-type rHbl (A) and the H77L 
mutant of rHbl (B). Dashed lines, Ferric; solid lines, oxygenated; and 
combination dashed/solid lines, ferrous forms of Hb. 

chemical studies. We prepared a recombinant Hb by sub- 
cloning a cDNA for rice Hbl into the vector pET28a and 
then expressing it in E. coli. The N terminus of the rHbl 
was determined to be ALVEDNNA VAV, which is identical 
to the predicted sequence of the Hbl cDNA (Fig. 1A), 
indicating that the correct recombinant protein had been 
synthesized. 

Analysis of the rHbl shows that it exhibits spectral char- 
acteristics that are similar to Hbs (Fig. 6A), including re- 
versible binding of 0 2 . However, rHbl exhibits some dis- 



tinctive absorption bands both in the ferric and the 
deoxyferrous forms. Differences in the globin-heme link- 
age were apparent from the deoxyferrous spectrum be- 
cause, at pH 7.0, the unligated ferrous state exhibits two 
peaks at 526 and 556 nm, which is similar to the absorption 
spectra of Cyt b, where the heme is hexa coordinate and the 
Fe is principally in the low-spin state (Smith, 1978; Weiss 
and Ziganke, 1978). These spectra are in marked contrast to 
the symbiotic plant Hbs and animal Mbs and Hbs, in which 
the Fe is pentacoordinate and displays a broad peak cen- 
tered at 556 nm in their deoxyferrous form (Appleby, 1974, 
1992). Thus, it is apparent that the ferric and deoxyferrous 
forms of rHbl contain a distal ligand that was identified as 
His74. Absorbance spectra of the ferric, oxy-, and deoxy- 
ferrous forms of an H74L mutant of rHbl shows no evi- 
dence of His coordination (Fig. 6B), and the ferric peak at 
405 nm suggests that the ligand-binding site is partially 
occupied by a water molecule. Furthermore, the addition of 
exogenous imidazole to ferric or deoxyferrous H74L mu- 
tant results in a spectrum identical to the corresponding 
form of the wild-type rHbl. 

Kinetic analysis of ligand binding shows that the rHbl 
has an unusually high affinity for 0 2 (Table I). The 0 2 - 
association constant of rHbl is similar to other O z storage 
and transport proteins, such as soybean Lba, however, the 
high affinity of rHbl for 0 2 results from a very low disso- 
ciation constant. Similar values for the dissociation con- 
stant have been reported for barley (Duff et al., 1997) and 
Arabidopsis (Trevaskis et al., 1997) recombinant Hbs, and 
suggest that these proteins are not involved in 0 2 transport 
through facilitated diffusion. Rate constants of rice rHbl 
for the reaction with CO are similar to 0 2 -transport pro- 
teins, which indicates that the unique O z reactivity is a 
result of specific interactions between 0 2 and the protein. 

Removal of His74 had a profound effect on the rate 
constants for 0 2 binding. The association constant in- 
creases approximately 9-fold as a result of the H74L mu- 
tation, and the dissociation constant increases nearly 1000 
times. This is similar to the effect of removal of the distal 
His in Mb (Springer and Sligar, 1987), but is different from 
the smaller effect seen with the same mutation in soybean 
Lba (Hargrove et al., 1997) (Table I). These results suggest 
that His74 forms a strong H + bond with bound 0 2 , which 
contributes to the slow dissociation constant. However, the 
rate constant for the mutant, which is similar to soybean 
Lb, combined with the extremely low value for the wild- 



Table I. Rate and equilibrium constants for the reaction of 0 2 and CO with rice wild-type rHbl and 
the H77L mutant of rHbl 
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b Dissociation constant. 


c Ligand affinity. 
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type rHbl, suggests that there might be an additional 
mechanism limiting 0 2 dissociation. 

Nonsymbiotic Hb from rice, and presumably all Hbs of 
this class, appear to operate through a unique mechanism. 
His H74 coordinates the heme Fe in the deoxy-protein, but 
when 0 2 binds the side chain moves to a position from 
which it can form a stabilizing interaction with bound 0 2 . 
This is very different from the behavior of Mb mutants, in 
which a His coordinates the heme Fe. These proteins are 
very unstable and do not react with 0 2 due to an extraor- 
dinarily high autooxidation rate (Duo et al., 1995). 

Function of Nonsymbiotic Hbs 

The function of plant Hbs in nonsymbiotic tissues is not 
known; however, it has been hypothesized that the role of 
nonsymbiotic Hbs is probably not to facilitate the diffusion 
of 0 2 in roots, but rather to sense levels of 0 2 (Appleby et 
al., 1988). Appleby et al. (1988) suggested that under nor- 
mal aerobic conditions Hb would be oxygenated, and that 
under 0 2 ~Hmiting conditions, deoxyferrous Hb levels 
would increase and trigger an anaerobic response. Al- 
though 0 2 -sensor heme proteins have been described in 
other systems (Gilles-Gonzalez et al., 1991, 1994, 1995; 
Gilles-Gonzalez and Gonzalez, 1993), the above hypothesis 
was questioned recently by Andersson et al. (1996), who 
suggest that Hbs may function as 0 2 carriers in metaboli- 
cally active tissues. 

Our results with rice plants, including the expression 
profiles in aerobic tissues (Fig. 5), expected levels of Hbs in 
tissues, and kinetic parameters of Hbl (Table I), suggest 
that these proteins are unlikely to facilitate the diffusion of 
0 2 . Also, because of the extremely low dissociation con- 
stant exhibited by rHbl, it is unlikely that rice Hbs function 
as an effective 6 2 donor to other proteins, such as mito- 
chondrial oxidases. However, under specific circum- 
stances, such as in barley roots grown under microaerobic 
conditions, in which high levels of Hb transcripts have 
been detected (Taylor et al., 1994), nonsymbiotic Hbs may 
have other functions, including participation in the anaer- 
obic response and possibly in specific metabolic aspects of 
dedifferentiated tissues. Recently, a multitude of functions 
have been suggested for nonplant Hbs, which includes the 
transport of ligands other than 0 2 (such as NO or CO), 
interaction with small organic molecules, 0 2 scavenging, or 
formation of complexes with regulatory proteins (Gdtz et 
al, 1994; Giardina et al., 1995; Goldberg, 1995; Jia et al., 
1996). The pattern of expression of nonsymbiotic hb genes 
in plants and the biochemical properties of the recombi- 
nant Hbs suggest that these proteins have other functions 
besides 0 2 transport, which are yet to be determined. 
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ABSTRACT We cloned two hemoglobin genes from Ara- 
bidopsis thaliana. One gene, ABB1, is related in sequence to the 
family of nonsymbiotic hemoglobin genes previously identified 
in a number of plant species (class 1). The second hemoglobin 
gene, AHB2, represents a class of nonsymbiotic hemoglobin 
(class 2) related in sequence to the symbiotic hemoglobin 
genes of legumes and Casuarina. The properties of these two 
hemoglobins suggest that the two families of nonsymbiotic 
hemoglobins may differ in function from each other and from 
the symbiotic hemoglobins. AHB1 is induced, in both roots and 
rosette leaves, by low oxygen levels. Recombinant AHB1 has an 
oxygen affinity so high as to make it unlikely to function as an 
oxygen transporter. AHB2 is expressed at a low level in rosette 
leaves and is low temperature-inducible. AHB2 protein has a 
lower affinity for oxygen than AHB1 but is similar to AHB1 in 
having an unusually low, pH-sensitive oxygen off-rate. 



Leghemoglobins, the first hemoglobins to be identified in 
higher plants (1), are found in the nodules of legumes where 
they are believed to transport oxygen to nitrogen-fixing en- 
dosymbiotic bacteria (2). Nodule-expressed hemoglobins also 
have been identified in Casuarina glauca, a nitrogen-fixing 
nonlegume (3). Collectively, these nodule-expressed hemoglo- 
bins are called "symbiotic hemoglobins'* (4). In addition to 
symbiotic hemoglobins, these plants have another hemoglobin 
that shows only limited amino acid sequence similarity to the 
symbiotic hemoglobins. Genes encoding this type of hemoglo- 
bin also have been cloned from the nitrogen-fixing elm Paras- 
ponia andersonii (5, 6) and from plants that do not fix nitrogen, 
including monocots (6-8). These "nonsymbiotic" hemoglo- 
bins are typically expressed at low levels in root tissue. The 
Parasponia hemoglobin is unusual in that it appears to be 
bi-functional; it transports oxygen in nitrogen-fixing nodules 
and, like the nonsymbiotic hemoglobins, is expressed in root 
tissue (6). The function of nonsymbiotic hemoglobins in nor- 
mal plant tissues has not been established. To examine the 
function of nonsymbiotic hemoglobins, we have cloned hemo- 
globins from Arabidopsis thaliana. 

MATERIALS AND METHODS 

Plant Growth and Stress Treatments. Arabidopsis C24 
ecotype seeds were surface sterilized, then grown on MS agar 
with 3% sucrose, 40-50 plants/plate, at 22°C (16/8 h light/ 
dark cycle) until harvest. Plants used for AHB1 induction 
analysis were grown on agar without sucrose. Dehydration, 
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chilling, and hypoxia treatments were performed as outlined in 
Dolferus et al (9). All treatments, except chilling (24 h), were 
for 12 h unless specified. Sucrose treatment involved placing 
plantlets in MS with 1 % sucrose overnight. RNA was extracted 
using the method of Logemann et al. (10) and separated on 
formaldehyde gels as described by Dolferus et al. (9); RNA gel 
blotting, hybridization, and posthybridization washes also were 
done as described by Dolferus et al. RNA loadings were 
calibrated with an Arabidopsis Ubiquitin riboprobe. 

Library Screening, Subcloning, and Sequencing. Arabidop- 
sis genomic libraries (9- to 15-kb fraction of 5aw3A-digested 
C24 genomic DNA in AEMBL4) were screened with random 
primed probes (11). Screening and subsequent subcloning 
involved standard conditions and DNA manipulation, as de- 
scribed in Ausubel et al (11). Automated DNA sequencing, 
with both dye primer and dye terminator reactions, was 
performed according to company specifications (Perkin- 
Elmer). 

Reverse Transcriptase-PCR Analysis. Reverse transcrip- 
tase-PCR was carried out on first strand cDNA that was 
synthesized from RNA with Moloney murine leukemia virus 
reverse transcriptase as per company specifications (Pro- 
mega). PCR was then performed with specific primers (100 
nM) and standard PCR conditions (11). The resulting reverse 
transcriptase-PCR products were cloned and sequenced, con- 
firming that the predicted introns are absent from the final 
transcript. 

Southern Analysis. Genomic DNA was extracted using the 
method of Taylor and Powell (12). Hybridization and post 
hybridization treatment was as described in Ausubel et al. (11). 

Database Search. Database searches were conducted 
through key word searching of the Expressed Sequence Tag 
database on the National Center for Biotechnology World 
Wide Web site (http://www.ncbi.nlm.nih.gov). 

Expression, Purification, and Analysis Of Recombinant 
Proteins. cDNA clones of AHB1 and AHB2 were fused to an 
efficient ribosomal binding site, as per Springer and Sligar (13), 
and inserted into the pET24(+) expression vector (Novagen). 
Escherichia coli, strain BL21(DE3), containing each expres- 
sion vector was grown to OD 60 onM of 0.6, and then recombinant 
protein expression was induced with 1 mM isopropyl /3-D- 
thiogalactoside. After 4 h at 37°C, cells were centrifuged and 
resuspended in 100 mM Tris-HCl (pH 8.0), sonicated, and 
cleared by centrifugation; 50 mg/ml hemin-HCl was added to 
the soluble fraction. Recombinant AHB1 protein was purified 
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d II ► 70 

*$ C2 CD1 E7 

Barley . SAAEGAWF SEEKEALVLK SWAIMKKDSA NLGLRFFLKI FEIAPSARQM PPFLRDSDVP LETNPKLKTH 

Casnon . SSTLEGRGF TEEQEALWK SWSAMKPNAG ELGLKFFLKI FEIAPSAQKL FSFLKDSNVP LERNPKLKSH 

Soynon .TTTLE.RGF SEEQEALWK SWNVMKKNSG ELGLKFFLKI FEIAPSAQKL PSFLRDSTVP LEQNPKLKPH 

Parasp .SSSEVNKVF TEEQEALWK AWAVMKKNSA ELGLQFFLKI FEIAJPSAKNL PSYLKDSPVP LEQNPKLKPH 

AHR1 . . E5EGKTVF TEE QEALWK SWSVMKKNSA ELCLKLFIKI FEIAPTTKKM FSFLRD5PIP AEONPKLKPH 

AHB2 GEIGF TEKQEALVKE SWEILKQDIP KYSLHFFSQI LEIAPAAKGL PSFLRDSDEV PHNNPKLKAH 

Cassym AL TEKQEALLKQ SWEVLKQNIP AHSLRLFALI IEAAPESKYV FSFLKDSNEI PENNPKLKAH 

Soylba GAF TEKQEALVSS SFEAFKANIP QYSWFYTSI LEKAPAAKDL FSFL--ANGV DPTNPKLTGH 

12 13 ^ 140 

F8 

Barley AVSVFVMTCE AAAQLRKAGK ITVRETTLKR LGGTHLKYGV ADGHFEVTRF ALLETIKEAL PADMWGPEMR 
casnon AMSVFLMTCE SAVQLRKAGK VTVRESSLKK LGASHFKHGV ADEHFEVTKF ALLETIKEAV P-ETWSPEMK 
soynon AVSVFVMTCD SAVQLRKAGK VTVRESNLKK LGATHFRTGV ANEHFEVTKF ALLETIKEAV P-EMWSPAMK 
Parasp ATTVFVMTCE SAVQLRKAGK VTVKESDLKR IGAIHFKTGV VNEHFEVTRF ALLETIKEAV P-EMWSPEMK 
AHR1 AM5VFVMCCE SAVQLRKTGK VTVRETTLKR LGASHSKYGV VDEHFEVAKY ALLETIKEAV P-EMWSPEMK 
AHB2 AVKVFKMTCE TAIQLREEGK VWADTTLQY LGSIHLKSGV IDPHFEWKE ALLRTLKEGL G - EKYNEEVE 
cassym AAVIFKTICE SATELRQKGH AVWDNNTLKR LGSIBLKNKI TDPHFEVMKG ALLGTIKEAI K-ENVJSDEMG 
soylba AEKLFALVRD SAGQLKASGT W-AD-A- -A LGSVHAQKAV TDPEFVWKE ALLKTIKAAV G-DKWSDELA 
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Fig. 1 . (a) Alignment of some plant hemoglobin protein sequences. Proximal and distal histidines, phenylalanine CD1 , and proline C2 are shown 
in bold. The location of primers used to amplify AHB1 PCR fragment are shown as arrows. $ represents the 5' end of the AHB2 cDNA (Clone 
ID 11161). Intron positions are marked with stars and labeled 11-13. Class 1 and 2 hemoglobins are separated by the horizontal line, (b) Cladistic 
analysis of plant hemoglobins. The left tree is a similarity tree of plant hemoglobin protein sequences created using the Genetics Computer Group 
pileup program. Class 1 and 2 hemoglobins form two distinct clades. The right tree is one of the two most parsimonious trees (CI .855, RC .654) 
generated by an exhaustive search of all possible trees using the paup, Ver. 3.1.1 program. The two trees of maximum parsimony differed in whether 
they grouped the Casuarina symbiotic hemoglobin or AHB2 with the leghemoglobins (represented by the arrow). The bootstrap value for the basal 
branch of this tree is 100%. The two classes of plant hemoglobins are highlighted by boxes. Hemoglobins (and GenBank accession numbers): barley: 
barley nonsymbiotic hemoglobin (U01228); casnon: Casuarina nonsymbiotic hemoglobin (X53950); soynon: soybean nonsymbiotic hemoglobin 
(U47143); parasp: Parasponia hemoglobin (U27194); AHB1: This study (U94998); AHB2: this study (U94999); cassym: Casuarina symbiotic 
hemoglobin (77695); soylba: soybean leghemoglobin A (J01299); medslba: Medico sativa leghemoglobin A (X14311); luplbl: lupin leghemoglobin 
1 (Y00401). 



by 25-70% ammonium sulfate fractionation, resuspended in 20 
mM Tris-HCl (pH 8.0), and dialyzed twice into 4 liters of the 
same buffer. Dialyzed extracts were concentrated (YM-10 
membrane, Amicon) and passed over a cation exchange col- 
umn (DE52 Whatman) in 20 mM Tris*HCl (pH 8.0). Red 
fractions were concentrated, adjusted to pH 6.0 with dilute 
acetic acid, and passed over an anion exchange column (CM52, 
Whatman) with 20 mM sodium phosphate buffer (pH 8.0). The 
red fraction was concentrated to 10 mg/ml and stored at 
-80°C. Recombinant AHB2 protein was purified as was 
described for AHB1, with the following modification: 25-90% 
ammonium sulfate fractionation. Elution from DE52 resin was 
achieved with 250 mM Tris-HCl, following 5 column volumes 
of 100 mM Tris-HCl wash buffer. After elution from CM52 



resin, red fractions were adjusted to 25% ammonium sulfate 
saturation (pH 8.0) and passed over a phenol-Sepharose 
column (equilibrated with 20 mM Tris'HCl/25% ammonium 
sulfate) then eluted with 20 mM Tris-HCl/12.5% ammonium 
sulfate and concentrated to 10 mg/ml for storage. Rate 
constants for O2 and CO binding were measured using rapid 
mixing and laser photolysis methods (14). 

RESULTS 

A Nonsymbiotic Hemoglobin Gene from Arabidopsis. Using 
degenerate primers that amplified a nonsymbiotic hemoglobin 
gene fragment from soybean (7), we amplified a 459-bp 
fragment of a nonsymbiotic hemoglobin gene from Arabidop- 
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FIG. 2. Southern blots of AHB1 and AHB2. Arabidopsis (C24 
ecotype) genomic DNA digested with BarnHl, Oral, £coRI, and Xbal 
(lanes 1, 2, 3, and 4, respectively) and hybridized with AHB1 and AHB2 
cDNA probes. (Extra bands on the Xbal lanes are due to partial 
digestion.) 

sis. This gene fragment was used to isolate a full-length 
hemoglobin gene, AHB1, with a predicted amino acid se- 
quence that shows strong homology to the sequences of 
nonsymbiotic hemoglobins from other plants (Fig. 1 a and b), 
including predicted ORFs containing amino acid residues that 
are conserved in all plant hemoglobins [the proximal F8 and 
distal E7 histidines, phenylalanine CD1 and proline C2 (15)]. 
There are three introns in positions identical to the introns 
occurring in all other plant hemoglobins (Fig. la). Southern 
analysis, at high stringency, suggests this gene is present in 
single or low copy number in the Arabidopsis genome (Fig. 2). 

A Leghemoglobin-Like Gene from Arabidopsis. We identi- 
fied a second hemoglobin gene (AHB2) by searching the 
Arabidopsis Expressed Sequence Tag database. The single 
partial length cDNA showing homology to plant hemoglobins 
(clone ID 1 1161) does not correspond in sequence to the AHB1 
gene, with only 69% identity at the nucleotide level. This gene, 
AHB2, has three introns in identical positions to those of other 
plant hemoglobins and, UkeAHBl, appears to be a single copy 
gene according to Southern blot data (Dral cuts within the 
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Fig. 4. Time course analysis of the AHB1 hypoxia response. Total 
RNA from whole plants treated with 5% oxygen for varying times 
hybridized with either AHB1 or ADH antisense riboprobes. 

coding sequence of AHB2, producing two bands in this lane; 
Fig. 2). When the amino acid sequence of AHB2 is aligned with 
other plant hemoglobin protein sequences, it appears to be 
more similar to the symbiotic hemoglobins of Casuarina and 
legumes than it does to nonsymbiotic plant hemoglobins (Fig. 
la). If the AHB2 protein sequence is used in a blast (basic 
local alignment search tool) search, it consistently detects 
symbiotic hemoglobins with higher blast scores than most 
nonsymbiotic hemoglobins. A similarity tree of plant hemo- 
globin protein sequences, constructed with the PILEUP function 
of the Genetics Computer Group package, also shows AHB2 
to be more similar to the symbiotic hemoglobins of legumes 
and Casuarina (Fig. lb). The same clade occurs in the two trees 
of maximum parsimony constructed using the PAUP program, 
Ver. 3.1.1, one of which is presented (Fig. lb). The bootstrap 
value for the basal branch of this clade is high (100%), 
supporting the grouping of AHB2 with the symbiotic hemo- 
globins of legumes and Casuarina. 

AHB1 Is Expressed in Root Tissue and Is Hypoxia- 
Inducible. AHB1 expression, examined with RNA gel blots, is 
detectable at low levels in root tissue and is strongly induced 
in both the roots and rosette leaves of plants subjected to 
hypoxic conditions (Fig. 3). The time course of the AHB1 
response to low oxygen levels is similar to that of the alcohol 
dehydrogenase gene (ADH), a well characterized hypoxic 
response gene (ref. 16; Fig. 4). The maximum level oiAHBl 
transcript was observed after 6-8 h of hypoxia treatment. 
Transcription of AHB1 is induced under a 5% oxygen envi- 
ronment, but stronger induction occurs at 0.1% oxygen (Fig. 
5). A transcriptional response also was elicited by a 1% sucrose 
solution. Cold, dehydration, heat shock, oxidative stress, and 
wounding treatments did not affect AHB1 transcript levels. 

AHB2 Is Expressed in Rosette Leaves. A low level oiAHB2 
expression was found in rosette leaves, but unlike AHB1, there 

1 2 3 4 5 6 




Fig. 3. AHB1 expression in root and rosette tissues. Total RNA 
extracted from rosette leaves (lane 1), rosettes treated with 5% oxygen 
(lane 2), roots (lane 3), and roots treated with 5% oxygen (lane 4) 
hybridized with an AHB1 antisense riboprobe. 



Fig. 5. AHB1 expression during environmental stress treatments. 
Total RNA from control plants (lane 1) or plants subjected to chilling 
(lane 2), \% sucrose (lane 3), dehydration (lane 4), 5% oxygen (lane 
5), or 0.1% oxygen (lane 6) hybridized with an AHB1 antisense 
riboprobe. 
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Fig. 6. Induction of AHB2 by low temperature. Total RNA from 
root tissue (lanel) or rosette leaves (lane 2) and root tissue after 
chilling treatment (lane 3) or rosette leaves after chilling treatment 
(lane 4) hybridized with an AHB2 antisense riboprobe. 

was no detectable expression in root tissue. AHB2 expression 
was not affected by sucrose or hypoxia treatment, but low 
temperature did increase AHB2 transcript levels (Fig. 6). The 
low temperature response seems to be confined to the rosette 
leaves with no response evident in root tissue (Fig. 6). Wound- 
ing, heat shock, oxidative stress, and dehydration had no effect 
on the level of AHB2 transcript. 

Recombinant AHB1 and AHB2 Proteins Have Different 
Oxygen Binding Properties. Recombinant AHB1 protein, 
expressed in E. coli> was purified, and the kinetics of 0 2 and 
CO binding were measured. AHB1 shows a remarkably high 
0 2 affinity, with a P 5 o value equal to 1.6 nM at pH 7, 20°C 
(Table 1). Similar oxygen binding parameters have been 
observed for the nonsymbiotic hemoglobins of soybean and 
rice (R.A.W., unpublished data, and M. Hargrove, personal 
communication). Lowering the pH to 5 raises the P 50 to 6.7 
nM (Table 1), but this value is still smaller than that of any 
known plant cytochrome oxidase (17), making it unlikely 
that AHB1 could transport oxygen to these terminal oxi- 
dases. AHB1 also has a low 0 2 dissociation rate constant 
compared with that of soybean leghemoglobin A, a known 
oxygen transporter (Table 1). Thus, AHB1 would appear to 
be an oxygen-scavenging protein rather than an oxygen 
transporter. 

In contrast, recombinant AHB2 has a much lower affinity 
for 0 2 , with P 50 values of 130 nM at pH 7 and 2,700 nM at pH 
5 (Table 1). The dissociation rate constant is low at pH 7 but 
increases 20-fold at pH 5, approaching the value for soybean 
leghemoglobin A under the same acidic conditions. The low 
association rate constant for 0 2 binding to AHB2 appears to 
be caused by the reversible formation of a hexacoordinate 
complex (hemochrome) in deoxygenated AHB2. The imida- 
zole side chain of the distal histidine (E7), the only polar 
residue in the distal pocket, may coordinate directly to the iron 
atom of the heme group, thereby inhibiting rapid 0 2 binding 
and facilitating 0 2 dissociation. 



Table 1. Comparison of parameters* for 0 2 binding to 
recombinant hemoglobins at 20°C 



Protein 


pH 


Pso, nM 


ko 2 , s" 1 


M K co /Ko 2 


AHB1 


7.0 


1.6 


0.12 


.2.0 




5.0 


6.7 


0.38 




AHB2 


7.0 


130 


0.14 


26 




5.0 


2700 


2.1 




Soy IbAt 


7.0 


44 


5.6 


87 


5.0 


20 


2.7 





* Where ko 2 and k'o 2 are the bimolecular dissociation and association 
rate constants for O2 binding, respectively. The dissolved 0 2 con- 
centration at which hemoglobin is half saturated (P 50 ) is equivalent 
to the dissociation equilibrium constant for O2 (K'o 2 = ko 2 /k'o 2 ). 
Ko 2 and Kco are the association equilibrium constants for O2 and 
CO, respectively. M is the partition coefficient expressing the relative 
affinities for 0 2 and CO. 

tRecombinant soybean leghemoglobin A. 



DISCUSSION 

AHB1 is related to the nonsymbiotic hemoglobin genes present 
in other plants. The identification of this gene in Arabidopsis 
supports the idea that nonsymbiotic hemoglobin genes may be 
present in all plants (6). The expression pattern otAHBl is 
similar to that of other nonsymbiotic hemoglobins; low levels 
of expression in root tissue are common to all nonsymbiotic 
hemoglobins (3-8), sucrose inducibility has been observed for the 
Casuarina nonsymbiotic hemoglobin (4), and the hypoxic induc- 
tion of AHB1 resembles that of the barley hemoglobin (8). 

AHB2 represents a new class of nonsymbiotic hemoglobin 
gene. It shows limited homology to the other nonsymbiotic 
hemoglobins and has a different pattern of gene expression. 
AHB2-\\)az genes may be present in a wide range of plants. In 
support of this theory, we have cloned an AHB2 homologue 
from Brassica napus (R.A.W., unpublished data). We suggest 
that the AHBl-Mkc "nonsymbiotic hemoglobin" gene family be 
referred to as class 1 plant hemoglobin genes and that AHB2- 
like genes be referred to as class 2 plant hemoglobin genes. 

It has been suggested that symbiotic hemoglobin genes of 
legumes and Casuarina arose from gene duplication of a 
nonsymbiotic (class 1) hemoglobin gene in the ancestor of 
nitrogen-fixing plants (5, 6). The similarity between^//B2 and 
symbiotic hemoglobins suggests an alternative explanation; a 
class 2 hemoglobin gene may have been the direct ancestor of 
the symbiotic hemoglobin genes. In contrast, it appears that a 
class 1 hemoglobin has been recruited into symbiotic function 
in Parasponia nodules. The recruitment of either class 1 or class 
2 plant hemoglobins into symbiotic roles seems to have been 
associated with changes in gene expression, leading to high 
levels of expression in nodules, and with alterations in the 
oxygen binding properties of the protein. 

Leghemoglobins facilitate oxygen flux down an oxygen 
gradient without the involvement of other proteins. It is 
unlikely that either AHB1 or AHB2 acts as an oxygen trans- 
porter in this manner. The oxygen affinity of AHB1 is too high 
for release of oxygen to cytochrome oxidase, and the oxygen 
dissociation rates of both proteins are too slow for efficient 
oxygen transport. Instead, these proteins may act as oxygen 
scavengers, binding oxygen until interactions with other pro- 
teins occur. The eventual destination for bound oxygen could 
be mitochondrial metabolism or oxygen-consuming enzymatic 
reactions. It is also possible that AHB1 or AHB2 have un- 
identified catalytic properties. Analysis of transgenic plants 
that over- or underexpress AHB1 or AHB2, combined with 
cellular localization of these proteins, should lead to a better 
understanding of the functions of these and other plant 
hemoglobins. 

We thank Janice Norman for her excellent technical assistance and 
Cyril Appleby for his helpful discussions. 

1. Kubo, H. (1939) in Appleby, C. A. (1992) ScL Prog. 76, 365-398. 

2. Appleby, C. A. (1984) Annu. Rev. Plant Physiol 35, 443-478. 

3. Fleming, A. I., Wittenberg, J. B., Wittenberg, B. A., Dudman, 
W. F. & Appleby, C. A. (1987) Biochim. Biophys. Acta 911, 
209-220. 

4. Jacobsen-Lyon, K., Jensen, E. 0., Jorgensen, J., Marcker, K. A., 
Peacock, W. J. & Dennis, E. S. (1995) Plant Cell 7, 213-223. 

5. Landsmann, J., Dennis, E. S., Higgins, T. J., Appleby, C. A., 
Kortt, A. A. & Peacock, W. J. (1986) Nature (London) 324, 
166-168. 

6. Bogusz, D., Appleby, C. A., Landsmann, J., Dennis, E. S. & 
Peacock, W. J. (1988) Nature (London) 331, 178-180. 

7. Andersson, C. R., Jensen, E. 0., Llewellyn, D. J., Dennis, E. S. & 
Peacock, W. J. (1996) Proc Natl. Acad. Sci. USA 93, 5682-5687. 

8. E. R. Taylor, E. R., Nie, X. Z., MacGregor, A. W. & Hill, R. D. 
(1994) Plant Mol Biol 24, 853-862. 

9. Dolferus, R., Jacobs, M., Peacock, W. J. & Dennis, E. S. (1994) 
Plant Physiol. 105, 1075-1087. 



12234 Plant Biology: Trevaskis et al. 



Proc. Natl. Acad, Sci. USA 94 (1997) 



10. Logemann, J., Schell, J. & Willmitzer, L. (1987) Anal. Biochem. 
163, 16-20. 

11. Ausubel, F. M., Brenton, R., Kingston, R. E., Moore, D. D., 
Siedman, J. G., Smyth, J. A. & Struhl, K. (1994) Current Protocols 
in Molecular Biology (Wiley, New York). 

12. Taylor, A. & Powell, B. (1982) BRL Focus 4, 4. 

13. Springer, B. A. & Sligar, S. G. (1997) Proc. Natl. Acad. Sci USA 
84, 8961-8965. 

14. Hargrove, M. S., Barry, J. K., Bruckner, E. A., Berry, M. B., 
Phillips G. N., Jr., Olson. J. S., Arrendo-Peter, R., Dean, 



J. M., Klucas & R. V. Sarath: G. (1997) /. Mol Biol 266, 
1032-1042. 

15. Chang, C. & Meyerowitz, E. M. (1986) Proc. Natl. Acad. Sci. USA 
83, 1408-1412. 

16. Bashford, D., Chothia, C. & Lesk A. M. (1987) J. Mol. Biol 196, 
199-216. 

17. Millar, A. H., Bergerson, F. J. & Day, D. A. (1994) Plant Physiol. 
Biochem. 32, 847-852. 

18. Wittenberg, J. B. & Wittenberg, B. A (1990) Annu. Rev. Biophys. 
Biophys. Chem. 19, 217-241. 



Proc. Nad. Acad. Sci. USA 

Vol. 93, pp. 5682-5687, June 1996 

Plant Biology 

A new hemoglobin gene from soybean: A role for hemoglobin in 
all plants 

(nonsymbiotic/leghemoglobin/evolution) 

Carol R. ANDERSsoN*tt, Erik Ostergaard Jensen§, Danny J. Llewellyn*, Elizabeth S. Dennis*, 
and W. James Peacock*^ 

♦Commonwealth Scientific and Industrial Research Organization Plant Industry, G.P.O. Box 1600, Canberra, ACT 2601, Australia; ^Division of Biochemistry and 
Molecular Biology, Australian National University, G.P.O. Box 4, Canberra ACT 2601, Australia; and ^Laboratory of Gene Expression, University of Aarhus, 
Gustav Weids Vej 10, Aarhus C 8000 Denmark 

Contributed by W. James Peacock, January 24, 1996 



ABSTRACT We have isolated a new hemoglobin gene 
from soybean. It is expressed in cotyledons, stems of seedlings, 
roots, young leaves, and in some cells in the nodules that are 
associated with the nitrogen-fixing Bradyrhizobium symbiont. 
This contrasts with the expression of the leghemoglobins, 
which are active only in the infected cells of the nodules. The 
deduced protein sequence of the new gene shows only 58% 
similarity to one of the soybean leghemoglobins, but 85-87% 
similarity to hemoglobins from the n on legumes Parasponia, 
Casuarina, and barley. The pattern of expression and the gene 
sequence indicate that this new gene is a nonsymbiotic legume 
hemoglobin. The finding of this gene in legumes and similar 
genes in other species strengthens our previous suggestion 
that genomes of all plants contain hemoglobin genes. The 
specialized leghemoglobin gene family may have arisen from 
a preexisting nonsymbiotic hemoglobin by gene duplication. 



Despite the common perception of hemoglobin as a blood 
protein, the protein is also found in many invertebrates, 
bacteria, fungi, and in some higher plants. Plant hemoglobin 
was thought to be restricted to the nitrogen-fixing root nodules 
of legumes in the highly specialized symbiosis with strains of 
Rhizobium or Bradyrhizobium bacteria. The function of these 
legume hemoglobins (leghemoglobins) is the facilitated diffu- 
sion of oxygen to the respiring bacteroids within the root 
nodule (for review, see ref. 1). 

More recently, hemoglobin and hemoglobin genes have 
been discovered in the non legume genera Parasponia, Trema, 
Casuarina, Hordeum, Triticum, and Zea (2-8). The finding of 
hemoglobin genes in non-nodulating plants (Trema, Hordeum, 
Triticum, and Zea) and in phylogenetically diverse plant gen- 
era, including monocots, reinforces the proposal that hemo- 
globin may be present in all plants (9). Two roles for plant 
hemoglobin outside the nitrogen-f being symbiosis have been 
proposed: (i) as a facilitator of oxygen diffusion in rapidly 
respiring cells or (ii) as an oxygen sensor involved in switching 
plant metabolism to anaerobic pathways (9). 

In general terms, there appear to be two different types of 
plant hemoglobins, the symbiotic (nodule specific and associ- 
ated with a symbiotic interaction involving microorganisms) 
and the nonsymbiotic hemoglobins (expressed in non-nodule 
tissues such as roots and stems). The situation in the nodulating 
tree Parasponia andersonii is unusual in that a single gene 
encodes a protein that has both a symbiotic and nonsymbiotic 
role (3). 

The symbiotic hemoglobin genes have been postulated to 
have arisen by duplication of preexisting, nonsymbiotic, he- 
moglobin genes (9). Casuarina glauca presents a possible 
example of such a gene duplication and divergence process in 
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that it has both a single nonsymbiotic hemoglobin gene ex- 
pressed in all tissues analyzed, and a different, symbiotic 
hemoglobin gene family expressed exclusively in the nodules 
induced by Frankia, an actinomycete (6, 7). Gene duplication 
with subsequent specialization is a common evolutionary 
strategy and recent evidence suggests that in legumes other 
nodule specific proteins have arisen by this mechanism, e.g., 
nodulin 26 (10). 

If symbiotic hemoglobin genes were derived by duplication 
from nonsymbiotic genes, then where are the nonsymbiotic 
hemoglobins in legume species? Legumes have well-char- 
acterized, nodule-specific leghemoglobin gene families, but 
apart from a single report of a hemoglobin-like protein in the 
seeds of winged bean (11), expression of hemoglobin genes 
outside nodules has never been detected by either nucleic acid 
hybridization or reporter gene assays. 

In this paper, we report the isolation and characterization of 
a nonsymbiotic hemoglobin gene from soybean and confirm 
that similar genes are present in other legumes. The gene is 
clearly related to nonsymbiotic plant hemoglobins on the basis 
of its predicted amino acid sequence and gene structure, 
including the promoter sequence; its expression pattern within 
the plant is unlike that of the symbiotic leghemoglobins. 

MATERIALS AND METHODS 

Isolation of Genomic DNA and Southern Blot Analysis. 

Isolation of genomic DNA and Southern blot analysis of 
hemoglobin genes from Glycine max cv Lincoln, Pisum sativum 
cv Greenfeast, Pisum humile, Medicago sativa cv R15 and 
Trifolium repens cv Haifa was as described (3). Probes were 
either the PCR fragment soyhblf+4r2 or the leghemoglobin 
clones 13.0/1.6 and 7.5/2.0 (12), which together cover the 
entire coding region of soybean Ibpsl. 

PCR Reactions. Degenerate primers were designed to con- 
served regions of plant nonsymbiotic hemoglobins (see Fig. 1) 
as follows: Hbexonlf (hemoglobin exon 1 forward primer), 
5'-CGGAAIICGA(A/G)GA(A/G)(C/G)A(A/G)GA(A/ 
G)GCI(T/C)TIGT; Hbexon2f (exon2 forward primer), 
5 / - CGGAATT CATITT(T/C)GA(A/G)ATIGCICC; Hbexon2r 
(exon 2 reverse primer), 5'-CGGGATCCGC(A/G)TGIII(T/ 
C)TTIA(A/G)(T/C)TTIGG(A/G)TT; Hbexon4r2 (exon 4 
reverse primer), 5'- CGGGATCC GC(T/OTC{T/OTTIAT- 
IGT(T/C)TC. 

Template DNA was digested with Hin&lU oxXhol, followed 
by heat inactivation at 65°C for 20 min. The PCR reaction 
included Taq DNA polymerase buffer (Promega)/! mM 
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MgCl 2 /200 /xM each dNTP/1 ju,M each primer/250 ng di- 
gested genomic DNA/2.5 units Taq DNA polymerase (Pro- 
mega) in a total reaction volume of 15 ftl. Reactions were 
performed using a Corbett thermal cycler. The reaction in- 
corporated stepwise reductions in annealing temperature 
("touchdown PCR") from 50°C to 37°C, and slow ramp times 
from the annealing to the extension temperature. The dena- 
turation times were 30 sec, with annealing and extension times 
of 1 min. PCR products were digested with BamH\ and£coRI, 
purified from a polyacrylamide gel, and cloned into pBlue- 
script SK (— ) (Stratagene) or pGem 3Zf(-) (Promega). 
Plasmid inserts were sequenced using a Pharmacia T7 se- 
quencing kit. 

The end-labeled hemoglobin specific primer soyhb2f35 (soy- 
bean hemoglobin exon 2 forward 35 mer) 5'- CTCATTCTT- 
GAGAGATTCAACGGTTCCTTTGGAGC was used to 
identify longer products in subsequent PCR reactions. 

Isolation of Soybean Genomic Clones. A soybean genomic 
library in lambda EMBL4 was constructed from size fraction- 



ated, partial Sau3A digested genomic DNA and screened with 
a soyhblf+4r2 probe using standard methods (13). 

Northern Blot Analysis. Soybean tissue from shoots, leaves, 
stems, flowers (all stages), seed pods, roots, nodules, and 
germinating seeds were harvested, snap frozen in liquid ni- 
trogen, and stored at -80°C. To eliminate the possibility that 
root tissue contained nodules, roots were harvested from 
seedlings grown from surface sterilized seeds sown in sterilized 
vermiculite/sand. Total RNA was isolated and analyzed by 
Northern blotting as described (3). Filters were probed with an 
antisense RNA probe (Amersham in vitro transcription kit) 
synthesized from soyhblf+4r2, subsequently stripped and 
reprobed with an A. thaliana ubiquitin (14) antisense RNA 
probe. The hybridization signals were quantified using a 
Phosphorlmager (Fujix) and the abundance of the soybean 
hemoglobin message expressed relative to ubiquitin. The size 
of transcripts were estimated relative to RNA standards 
(BRL). 
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l hbexQnlf> 

SSS EVDKVFTEEQ EALWKSWAV 

SSS EVNKVFTEEQ EALWKAWAV 

ST LEGRGFTEEQ EALWKSWSA 

TT TLERGFSEEQ EALWKSWNV 

ALVEDNNAV. . . AVSFSEEQ EALVLKSWAI 
ALVEGNNGVS GGAVSFSEEQ EALVLKSWAI 

SAA EGAWFSEEK EALVLKSWAI 

GAFTEKQ EALVSSSFEA 

. : GAFTDKQ EALVSSSFEA 

VAFTEKQ DALVSSSFEA 

GAFTEKQ EALVSSSFEA 

GAFTEKQ EALVNSSWEA 

GFTDKQ EALVNASYEA 

SFTDKQ EALVNSSYEA 

GFTEQQ EALVNSSSQL 

GFTDKQ EALVNSSSE. 

GVLTDVQ VALVKSSFEE 

ALTEKQ EALLKQSWEV 



61 <hbexon2r 

KDSPRPLEQN PKLKPHAMTV 

KDS PVPLEQN PKLKPHATTV 

KDSNVPLERN PKLKSHAMSV 

RDSTVPLEQN PKLKPHAVSV 

RNSDVPLEKN PKLKTHAMSV 

RNSDVPLEKN PKLKTHGMSV 

RDSDVPLETN PKLKTHAVSV 

, . SNGVDPSN PKLTGHAEKL 

. . ANGVDPTN PKLTGHAEKL 

. . ANGVDPTN PKLTGHAEKL 

. . ANGVDPTN PKLTGHAEKL 

. . ANGVDPTN PKLTAHAESL 

KDSDGVPQNN PSLQAHAEKV 

KDSAGV.QDS PQLQAHAEKV 

KDSAGW.DS PKLGAHAEKV 

KDTAGV . EDS PKLQAHAEQV 

KGSSEVPQNN PDLQAHAGKV 

KDSNEIPENN PKLKAHAAVI 

121 < hbexon 4r2 
FEVTRFALLE TIKEAVP.EM 
FEVTRFALLE TIKEAVP.EM 
FEVTKFALLE TIKEAVP. ET 
FEVTKFALLE TIKEAVP.EM 
FEWKFALLD TIKEEVPADM 



FVMTCESAVQ 
FVMTCESAVQ 
FLMTCESAVQ 
FVMTCDSAVQ 
FVMTCEAAAQ 
FVMTCEAAAX 
FVMTCEAAAQ 
FGLVRDSAGQ 
FGLVRDSAGQ 
FALVRDSAGQ 
FALVRDSAGQ 
FGLVRDSAAQ 
FGLVHDAAGQ 
FGLVRDSASQ 
FGMVRDSAVQ 
FGLVRDSAAQ 
FKLTYEAAIQ 
FKTICESATE 



WSPEMKNAWG 
WSPEMKNAWG 
WSPEMKNAWG 
WSPAMKNAWG 
WSPAMKSAWS 



MKKNSAELGL 
MKKNSAELGL 
MKPNAGELGL 
MKKNSGELGL 
LKKDSANIAL 
MKKDSANIGL 
MKKDSANLGL 
FKANIPQYSV 
FKTNI PQYSV 
FKANIPQYSV 
FKANIPQYSV 
FKGNI PQYSV 
FKKNLPGHSV 
FKQNLSGYSV 
FKQNPSNYSV 
FKQNLQGYAT 
FNANIPKNTH 
LKQNIPAHSL 



LRKAGKVTVR 
LRKAGKVTVK 
LRKAGKVTVR 
LRKAGKVTVR 
CGKAGKVTVR 
CGKAGKVTVR 
LRKAGKITVR 
LKANGTW . A 
LKASGTW . I 
LKASGTW . A 
LKTNGTW . A 
LRANGAW . A 
LRATGWVLA 
LRATGGWLG 
LRATGEWL . 
LRTKGEWLG 
LQVNGAVASD 
LRQKGHAVWD 



EAYDQLVAAI 
VAYDQLVAAI 
EAYDKLVAAI 
EAYDQLVDAI 
EAYDHLVAAI 



* hbexon2 f > 60 
KFFLKIFEIA PSAKNLPSYL 
QFFLKIFEIA PSAKNLPSYL 
KFFLKIFEIA PSAQKLFSFL 
KFFLKIFEIA PSAQKLFSFL 
RFFLKIFEVA PSASQMFSFL 
RFFLKIFEVA PSGCQMPSFL 
RFFLKIFEIA PSARQMPPFL 
VFYTS I LEKA PAAKDLFSFL 
VFYTSILEKA PVAKDLFSFL 
VFYTS I LEKA PAAKDLFSFL 
VFYNSILEKA PAAKDLFSFL 
VFYTSILEKA PAAKNLFSFL 
LFYSFILEKE PAAKGLFSFL 
FFYTVILEKA PAAKGLFSFL 
LFYTIILQKA PTAKAMFSFL 
LFYTIILEKA PAAKGLFSFL 
RFFTLVLEIA PGAKDLFSFL 
RLFALILEAA PESKYVFSFL 



ESNLKRLGAI 
ESDLKRIGAI 
ESSLKKLGAS 
ESNLKKLGAT 
DTTLKRLGAT 
DTTLKRLGAT 
ETTLKRLGGT 
.DA. . ALGSI 
.DA. .ALGSI 
. DA . . ALGSV 
.DA. .ALVSI 
.DA. .ALGSI 
. DA . . SLGSV 
.DA. .ALGA I 
. DG . . KDGSI 
.NA. . TLGAI 
. ATLKSLGSV 
NNTLKRLGSI 

169 

KSEMKPSST 
KFEMKPSST 
KLEMKPSS. 
KSEMKPPSS 
KQEMKPAE . 



120 

HFKNGWNEH 
HFKTGWNEH 
HFKHGVADEH 
HFRTGVANEH 
XLKYGVGXPX 

HLKYGVADGH 
HAQKAITDPQ 
HAQKAITDPQ 
HAQKAVTDPQ 
HAQKAVTDPQ 
HSQKGVNDSQ 
HVQKGVTDPH 
HIQKGWDPH 
HIQKGVLDPH 
HAGKGVTNPH 
HVSKGWDAH 
HLKNKITDPH 



FEVTRFALLE 
FVWKEALLK 
FVWKEALLK 
FVWKEALLK 
FVWKEALLK 
FLWKEALLK 
FVWKEALLK 
FVWKEALLK 
FVWKEALLK 
FWVKEALLQ 
FPWKEAILK 
FEVMKGALLG 



TIKEALPADM 
TIKEAVG.DK 
TIKEAVG.DK 
TIKAAVG.DK 
TIKEAVG.GN 
TLKEAVG . DK 
TLKEAAG . AT 
TIKEAAG.DK 
TIKEASG.DK 
TIKKASG.NN 
TIKEWG.DK 
TIKEAIK. EN 



WGPEMRNAWG 
WSDELSSAWE 
WSDELSSAWE 
WSDELSRAWE 
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Fig. 1. Alignment of the predicted amino acid sequences of various plant hemoglobins using the GCG pileup program. The highly conserved 
residues involved in heme and ligand binding [the distal and proximal histidines, phenyalanine CD1 and proline C2 (1)] are in bold. Intron positions 
are marked with asterisks. Conserved regions used to design degenerate PCR primers are overlined. Ricehbl and ricehb2, expressed sequence tag 
sequences from rice, GenBank accession numbers D15507and D23324 or D25122 and D22678, respectively (X is indicated at positions of ambiguity 
in the entered DNA sequence); barleyhb (8); Trema tomentosa hemoglobin (tremahb) (4); Parasponia andersonii hemoglobin (parahb) (3), Casuarina 
glauca nonsymbiotic hemoglobin (casnonsymhb) (6); soybean nonsymbiotic hemoglobin (soyhb) (this study), lupin hemoglobin 1 (lupinllb) (15); 
soybean lba and lbcl leghemoglobins (soylba and soylbcl) (16); soybean lbc2 and lbc3 leghemoglobins (soylbc2 and soylbc3) (17); kidney bean 
leghemoglobin (kidneybean) (18); Sesbania rostrata leghemoglobin II (sesbaniaii) (19); Medicago truncatula leghemoglobinl (medicagol) (20); 
broadbean leghemoglobin (broadbean) (21); pea leghemoglobin (pealb) (22), Casuarina glauca symbiotic hemoglobin (cassymhb) (7). 
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Fig. 2. Southern blot analysis of the soybean nonsymbiotic hemo- 
globin gene and leghemoglobin genes. Glycine max cv Lincoln genomic 
DNA was digested with EcoR I (lane 1), BamHl (lane 2),//mdIII, (lane 
3) and Bglll (lane 4). (A) Soybean nonsymbiotic hemoglobin (soybean 
lf+4r2 PCR fragment probe). (B) Soybean leghemoglobins (Ibpsl 
13.0/1.6 and 7.5/2.0 probes). Lambda Hindlll size standards are in kb. 

RESULTS 

A New Hemoglobin Gene in the Soybean Genome. Short 
degenerate oligonucleotides designed from the conserved 
regions of known nonsymbiotic plant hemoglobins (Fig. 1) 
were used in a PCR reaction on soybean genomic DNA. 
Primers within the second exon (hbexon2f and hbexon2r) 
amplified a number of fragments including one of « 1 10 bp, the 
size predicted from other plant hemoglobins. The 110-bp 
fragment had an open reading frame that contained residues 
known to be critical in hemoglobin structure and function, and 
was partially homologous to the known nonsymbiotic plant 
hemoglobins. A longer gene fragment was generated using de- 
generate primers derived from sequences in the first and last 
exons of nonsymbiotic hemoglobins (hbexonlf and hbexon4r2). 
As the genomic DNA was likely to include three introns, the 
hemoglobin-specific Soyhb2f35 probe was used to identify the 
correct product from this second PCR reaction. 

The longer clone (lf+4r2) contained the original 110-bp 
fragment and was used to isolate corresponding genomic 
segments. The gene sequence (GenBank accession no. TJ47143) 
had a five amino acid N-terminal extension similar to that in the 
nonsymbiotic and P. andersonii hemoglobins (Fig. 1), and three 
introns at positions found in all plant hemoglobins. 

The New Hemoglobin Gene Does not Hybridize to the 
Leghemoglobins; Homologues Are Present in Other Legumes. 
Southern blot analysis, using the If +4r2 PCR fragment and the 



previously described leghemoglobin gene probes, showed no 
hybridizing bands in common between these two classes of 
genes in soybean (Fig. 2) as expected from the low similarity 
between their nucleotide sequences (Table 1). Using the 
lf+4r2 probe, EcoR I -digested soybean DNA displayed two 
hybridizing bands of about equal intensity, whereas the BamHl 
and Bglll lanes each had a single, more intense band. The 
Hindlll digest produced two bands of different intensities. As 
the soybean lf+4r2 probe does not contain an EcoRl site, we 
conclude that the soybean genome contains two copies of the 
new hemoglobin gene and, because there is a single BamHl 
fragment, that these might be physically linked. Alternatively, 
and perhaps more likely, because soybean is an ancient tet- 
raploid, the two copies may be present on homeologous 
chromosomes, giving rise to separate restriction fragments of 
equivalent length. It is unlikely that the two bands in the £a?RI 
and Hindlll digests represent restriction fragment length 
polymorphisms of a single gene because of the inbred nature 
of the soybean cultivar. 

Southern blot analysis of other legumes using the soybean 
lf+4r2 probe at moderate hybridization stringency detected 
one or more cross-hybridizing bands in white clover, lucerne, 
and pea (data not shown), indicating that an homologous 
nonsymbiotic hemoglobin gene(s) is also present in the ge- 
nomes of these species. 

The Promoter of the New Hemoglobin Gene Does Not 
Contain the Nodulin Consensus Sequences. The symbiotic 
hemoglobin promoters contain two motifs that have been 
shown to be critical for nodule specific expression (the nodulin 
boxes, CTCTT and AAGAT) separated by six or seven nu- 
cleotides (23, 24). The nonsymbiotic (or dual function hemo- 
globin, in the case of the P. andersonii gene) hemoglobin genes 
from C glauca, Trema tomentosa, and P. andersonii do not have 
these conserved motifs at this spacing (see Fig. 5). The critical 
CTCTT motif is absent from the nonsymbiotic genes. The 
promoter of the new soybean hemoglobin gene does not 
contain this motif either, but has the sequence CTCCC, 
identical in sequence and position to a motif in P. andersonii 
(see Fig. 5). Another motif shown to be critical for expression 
of the P. andersonii promoter (unpublished data), GAAGAG, 
is present in a similar position as GAAGGG is in the soybean 
promoter. These data suggest that the new soybean hemoglo- 
bin has a promoter that resembles promoters of the nonsym- 
biotic hemoglobins of other plant genera, more than it resem- 
bles the promoters of the symbiotic leghemoglobins in its own 
genome. 

The New Soybean Hemoglobin Gene Is Expressed in Many 
Tissues. Northern blot analysis of total RNA isolated from 
various soybean tissues detected a nonsymbiotic hemoglobin 
mRNA of 830 nt. The message was readily detected in stems 
of mature plants and cotyledons of seedlings (Fig. 3). The gene 
was also expressed in roots, nodules, and young leaves of 
mature plants, but was virtually absent from older leaves, floral 
tissue, and seed pods. The level of expression in nodules was 
comparable to that in non-nodulated roots and considerably 
lower than that of leghemoglobin genes in the infected cells of 
nodules. The new hemoglobin's transcript was present at a 



Table 1. Overall sequence similarity (%) between various plant hemoglobins 
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Numbers above the diagonal represent similarity between predicted hemoglobin amino acid sequences; numbers below the 
diagonal represent similarity between nucleotide sequences in the coding region (see Fig. 1 legend). 
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Fig. 3. Nonsymbiotic hemoglobin gene expression patterns in soybean seedlings and plants. Total RNA was isolated from soybean shoot tips, 
young leaves, old leaves, flower buds, whole seed pods, stems, roots, nodules, root tips, and cotyledons, roots, and shoots from germinating seeds. 
(A) Hybridization pattern with a riboprobe corresponding to the soybean PCR fragment lf+4r2. (B) Re hybridization with an Arabidopsis ubiquitin 
probe provided levels of nonsymbiotic hemoglobin gene transcript relative to ubiquitin abundance. 



lower concentration in root tips than in the remainder of the 
root (Fig. 3). 
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Fig. 4. Plant hemoglobin protein similarity tree. The similarity tree 
was constructed using the GCG pileup program. Sequences used are 
referenced in Fig. 1. 



The Northern blot data are supported by in situ hybridiza- 
tion analyses that show that the soybean nonsymbiotic hemo- 
globin is expressed in both nodules and shoot tips. In nodules, 
the new hemoglobin gene is expressed in many cell types 
whereas leghemoglobins are present only in the bacteroid- 
containing cells (25). 

DISCUSSION 

The New Hemoglobin Gene Is Present in Soybean and Other 
Legumes and Encodes a "Nonsymbiotic" Protein. We have 
isolated a novel, expressed hemoglobin gene from soybean that 
is not a member of the leghemoglobin gene family. The gene 
has the characteristic four exon, three intron structure of all 
known plant hemoglobins. The predicted amino acid sequence 
is only 57-58% identical to leghemoglobin sequences (Table 
1), but contains the critical heme and ligand binding residues 
of a functional hemoglobin. It resembles other nonsymbiotic 
hemoglobins in having an extension of five amino acids at the 
amino terminus (Fig. 1, Table 1). This new soybean gene is 
most abundantly expressed in stems and cotyledons of seed- 
lings, and in roots, some nodule tissue, and young leaves of 
mature plants. Expression of the new soybean hemoglobin 
therefore differs considerably from that of the nodule-specific 
leghemoglobins and resembles that of the nonsymbiotic gene 
of C. glauca (7). Both the gene sequence and expression 
pattern indicate that we have isolated a nonsymbiotic soybean 
hemoglobin gene. Cross-hybridization of this soybean gene 
with sequences in the genomes of other legumes suggests that 
all legumes contain a similar nonsymbiotic hemoglobin in 
addition to their well-characterized leghemoglobin genes. 

Evolution of Plant Hemoglobins. The isolation of this gene 
from soybean adds another important example to a growing 
list of plant hemoglobins whose expression is not associated 
with any symbiotic interactions with micro-organisms. The 
GenBank EMBL data base contains the 5' sequences of two 
classes of rice expressed sequence tags with extensive homol- 
ogy to the nonsymbiotic hemoglobins (Fig. 1). The presence of 
expressed nonsymbiotic hemoglobin genes in a number of 
dicots, now including the legumes, as well as the monocots 
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- 1 3 3 AAATTTTTTAAAAGATCG -TTGTTTCTTCTTCATCATGCTGATTGA CACCCTCC ACAAGCCAAGAGAAAC - 6 5 

- 1 2 8 AGGATTTTGAAAAGATCA-TTGGCTCTTCGTCATGCCG ATTGA CACCCTCC ACAAGCCAAGAGAAAC - 6 3 

- 1 3 9 AAGATTTTGAAAAGATC ATTTGGCTCTTCATCATGCCG- - - ATTGA CACCCTCC ACAAGCCAAGAGAAAC - 6 3 

- 4 9 0 ACTTC AATCCCAAGATGTCCTCTCTCTTATTGATATTTG AACAACAAC AAAGATAAACAACCATTATCCCTACCAAGCAAGTAA - 4 0 8 
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Fig. 5. Sequence alignment of the promoter regions of hemoglobin genes from soybean, Parasponia, Trema, and Casuarina. Gaps, indicated 
by dashes, have been introduced to maximize similarity. The nodulin motifs, 5'-AAAGAT and S'-CTCTT, are indicated in bold type, as is the 
putative nonsymbiotic motif 5'-GAAGAG. The TATA box is underlined. Casuarina glauca symbiotic hemoglobin 1 (7) and C. glauca hb 
nonsymbiotic hemoglobin (6) and soybean nonsymbiotic hemoglobin (this study) are numbered from the start codon, Parasponia andersonii 
hemoglobin (3), Trema tomentosa hemoglobin (4), Soybean iba t Ibcl, lbc2, lbc3 (27) are numbered from the transcription start. The C. glauca 
symbiotic hemoglobin promoter contains a direct repeat in this region and both copies are included. 



barley, wheat, maize (8), and rice, shows that nonsymbiotic 
hemoglobin is widespread and possibly ubiquitous in the plant 
kingdom. It is likely that the more specialized symbiotic 
hemoglobins arose by gene duplication of a preexisting "non- 
symbiotic" gene. The presence in the ancient tetraploid soy- 
bean of two gene clusters of leghemoglobins (26), and now of 
two nonsymbiotic hemoglobin genes (Fig. 2), is consistent with 
an origin for the leghemoglobins by a gene duplication event 
occurring before the hybridization of the diploid progenitors of 
G. max. 

A similarity-tree based on pairwise comparisons of many of 
the known plant hemoglobin amino acid sequences was con- 
structed using the Genetics Computer Group (GCG; Madison, 
WI) pileup program (Fig. 4). All the nonsymbiotic hemoglo- 
bins cluster together, including the newly characterized gene 
from soybean. Within the nonsymbiotic hemoglobins, the dicot 
and monocot sequences form separate subclusters. All of the 
leghemoglobin sequences cluster together, with those from 
plants with determinate, indeterminate and lupin-type nodules 
forming separate subgroups, consistent with the analysis of 
Marcker and Sandal (26). The symbiotic hemoglobins of one 
species are more similar to symbiotic hemoglobins of other 
species than to the nonsymbiotic hemoglobins within the same 
species. This analysis suggests that leghemoglobins arose be- 
fore speciation within the legumes, the result of an initial gene 
duplication event that was followed by further duplications 
giving rise to the symbiotic leghemoglobin gene families within 
the present day legumes. 

The C. glauca symbiotic hemoglobin (cassymhb) clusters 
with the leghemoglobins rather than with the nonsymbiotic 
hemoglobins. The C. glauca nonsymbiotic hemoglobin protein 
is 87% similar to the soybean nonsymbiotic hemoglobin and 
groups with it on a separate subbranch of the dendrogram (Fig. 
4). The promoter motifs of these two nonsymbiotic genes are 
also similar (Fig. 5). Because the symbiotic hemoglobins 
(cassymhb and the leghemoglobins) are more similar to each 
other than to their putative progenitor nonsymbiotic hemo- 
globins in sequence, expression and regulation, they may have 
been derived from the same initial gene duplication before the 
divergence of Casuarina and the legumes. The similarities of 
the symbiotic and nonsymbiotic hemoglobins from Casuarina 
and soybean suggests that the species are more closely related 
than has previously been thought (28). A study based on the 
DNA sequences of the large subunit of the Rubisco gene has 
also placed the legumes close to Casuarina (29). 

Because of the animal/plant cross-kingdom occurrence of 
hemoglobin, the conserved functional amino acid residues in 
the proteins and the conserved positions of the introns within 
the genes, hemoglobins must be very ancient. They presumably 
predate the divergence of plants and animals and existed well 
before the diversification of the Angiosperms. We predict that 



hemoglobin genes will also be present in the genomes of more 
primitive plant groups, such as bryophytes, pteridophytes, 
cycads, and gymnosperms. 

Possible Functions of Nonsymbiotic Plant Hemoglobin. Two 
possible functions of nonsymbiotic plant hemoglobins have 
been proposed, as a sensor of oxygen concentration, or as a 
carrier in oxygen transport (9). Some hemoglobins are known 
to regulate associated enzymatic activities by changes in their 
conformation due to substrate binding (30, 31). Both the C. 
glauca and soybean nonsymbiotic hemoglobin genes have 
moderate levels of expression, and the proteins may therefore 
be more abundant than is expected for an oxygen sensor. The 
induction in barley of hemoglobin expression by low oxygen 
tension (8) suggests that hemoglobin may be a normal com- 
ponent of the anaerobic response in plants, presumably to 
facilitate oxygen diffusion at low oxygen concentrations and 
supports a role for nonsymbiotic hemoglobin in oxygen transport. 

Nonsymbiotic hemoglobin may facilitate oxygen diffusion in 
rapidly dividing cells, such as those in the root meristem. Our 
Northern blot analyses show that expression of the soybean 
nonsymbiotic hemoglobin gene is higher in the root elongation 
zone than in the root tip, suggesting that any requirement for 
oxygen transport is not confined to meristematic cells. The 
high levels of expression in cotyledons and stems also suggests 
that nonsymbiotic hemoglobin gene expression does not cor- 
relate with cell division, but is perhaps associated with high 
levels of metabolic activity. Nonsymbiotic hemoglobin genes 
are all expressed in various metabolically active tissues such as 
developing seeds and roots (4, 8) or in the vascular tissues of 
leaves, stems and seedling cotyledons (ref. 8 and this study). 
These are all sites of intense short distance solute transfer, an 
energy demanding process. It is possible that the nonsymbiotic 
hemoglobin is facilitating intracellular diffusion of oxygen to 
the mitochondria in metabolically active cells in order to meet 
an increased demand for oxidative respiration. 

The widespread distribution and expression of the nonsym- 
biotic hemoglobin genes implies that they play an important 
role(s) in the metabolic biochemistry of all plants, perhaps 
comparable with the oxygen binding roles of hemoglobins in 
animal systems. 

We thank Janice Norman for excellent technical assistance, Fraser 
Bergersen and Inara Licis for providing soybean plants, Robert Hill for 
making information regarding the barley hemoglobin gene available 
before publication, and U. H. Jacobsen for providing unpublished in 
situ hybridization data showing the expression of the soybean non- 
symbiotic hemoglobin gene in a number of cell types in soybean 
nodules. 
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Casuarina glauca has a gene encoding hemoglobin (cashb-nonsym). This gene is expressed in a number of plant tis- 
sues. Casuarina also has a second family of hemoglobin genes (cashb-sym) expressed at a high level in the nodutes 
that Casuarina forms In a nitrogen-fixing symbiosis with the actinomycete Frankia. Both the nonsymbiotic and symbiotic 
genes retained their specific patterns of expression when introduced into the legume Lotus corniculatus. We Interpret 
this finding to mean that the controls of expression of the symbiotic gene in Casuarina must be similar to the controls 
of expression of the leghemogtobfn genes that operate In nodules formed during the interaction between rhteobia and 
legumes. Deletion analyses of the promoters of the Casuarina symbiotic genes delineated a region that contains nodulin 
motifs identified In legumes; this region Is critical for the controlled expression of the Casuarina gene. The finding that 
the nonsymbiotic Casuarina gene is also correctly expressed in L. corniculatus suggests to us that a comparable non- 
symbiotic hemoglobin gene will be found in legume species. 



INTRODUCTION 

Nitrogen-fixing symbioses between plants and bacteria have 
been studied in detail in legumes, but a number of other plant 
families also have symbiotic nitrogen-fixing associations. For 
example, the woody shrub and tree species of Casuarina fix 
nitrogen in association with the actinomycete Frankia. Just as 
in the nodules of legumes, hemoglobin is present in the Casua- 
rina nodules, where it transports oxygen to the symbiont 
(Fleming et al., 1987). 

Kortt et al. (1988) showed that Casuarina nodule hemoglobin 
has 44% amino acid sequence identity with the leghemoglobin 
of soybean nodules. We isolated a hemoglobin gene from 
C. glauca using a Parasponia hemoglobin gene probe 
(Landsmann et al., 1986). Contrary to our expectation, the 
deduced amino acid sequence of the Casuarina gene did not 
correspond to the amino acid sequence of the Casuarina nod- 
ule hemoglobin (Christensen et al., 1991). The gene encoded 
a protein with a predicted amino acid sequence having only 
53% identity with the nodule protein, whereas it was 80% iden- 
tical to Parasponia hemoglobin. In Parasponia, the hemoglobin 
gene is expressed at a high level in nodules and at a lower 
level in normal root tissues (Landsmann et a!., 1986). These 
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findings suggested to us that the Casuarina gene we had iso- 
lated was likely to be expressed in plant tissue other than 
nodules and that Casuarina must have another gene that is 
expressed in nodules. In Parasponia, one gene performs both 
of these roles. 

In this study, we report the isolation of hemoglobin genes 
from Casuarina. These genes encode hemoglobin found in 
Casuarina nodules. We show that this hemoglobin gene fam- 
ily has a pattern of expression different from that of the 
previously isolated Casuarina gene. The newly isolated sym- 
biotic gene family is expressed only in the nodule. Nodule- 
specific expression of leghemoglobin genes has been studied 
extensively in transgenic legumes. The promoter regions 
of the soybean ibc3 and the Sesbania rostrata glb3 genes 
both confer nodule-specific expression in Lotus corniculatus 
(Stougaard et al., 1986; Szabados et al., 1990). Promoter de- 
letion and fusion analyses identified a cis regulatory element 
important for high-level nodule-specific expression (Stougaard 
et al., 1987; Szabados et al., 1990). This element (OSE (organ- 
specific element] in the ibc3 promoter and NICE [nodule- 
infected cell expression] in the glb3 promoter) contains two 
motifs, 5-AAAGAT-3' and 5'-CTCTT-3\ present in most nodulin 
promoters (Sandal et al., 1987). Site-directed mutagenesis 
showed that the contribution of the 5-AAAGAT-3' motif to 
nodule-specific expression is of minor importance, whereas the 
5'-CTCTT-3' motif as well as the adjacent 5-TGG-3' are essential 
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ggttaggatttctctctcttttcaattccaaacattttatcaaaaattagtcaaccctaatac 554 

ttatagaaataagttaagtagatttagatattggcttgattgattctatttctcacatcaaaa 617 

taatcaaaattggggttgggtgcacttcctccttttctcttcttttcattttgttcacttcag 680 

ttttttccttatcaaatgcataagagtcatcttaaggcttgcatctcaactcctattcttttt 743 

cttcttcttcctaatoaagcaatttcgatccaagtacttcaatgtctaaagccatcccatggt 806 
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LGSIHLXNKITDPHFE 
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ctactaattgcctccatgaatctagGTC ATC AAA GGA GCC TTA CTA GCA ACA ATC 1205 
VMKGALLGTI 

Hb-SymA 

HK-SymB ----- 

Hb-SymC 

Hbl 

AAA GAC GCA ATT AAA GAG AAT TGG AGT GAT GAG ATG GGT TGT GCA TGG 1253 
KEATKEP/WSDEMGCAW 
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aetcacttacatgtatttatggtatcaacccatatcatgcaccttcacatacttcatgtttgt 1427 
gtgtataaagggtgtaatgcttacataggttctcctagtaggcaagtaaatttcccatctata 1490 
otaaataaaaccttttattggggtcgtaaccaaggtcctgttactcctaaatgtgtgacaaat 1553 
aa 

Figure 1. Sequence of the Casuarina Symbiotic Hemoglobin Gene 
(cashb~$yml ). 



for expression of the nodule-specific gene (Ramlov et al., 1993; 
Szczyglowski et al., 1994). 

We introduced both the Casuarina symbiotic and nonsym- 
biotic hemoglobin promoters linked to the p-glucuronidase 
(GUS) reporter gene into L corniculatus. The patterns of ex- 
pression observed in Casuarina for both genes were retained 
in the transgenic legume. This suggests that in the two plant 
families not only do similar molecular mechanisms control the 
activity of the nodule-specific genes, but also common con- 
trols must exist for the plant tissue-expressed gene. This raises 
the possibility that legumes have a second family of hemoglo- 
bin genes that are expressed in plant tissues. These genes 
are yet to be described. 

The Qcomoler of the Casuarina symbiotic gene contains 
nodulin motifs similar to those found in leghemoglobin genes. 
Deletion analysis showed that the region containing the nodulin 
motifs was essential for expression of the reporter gene con- 
struct in transgenic nodules. 



RESULTS 



Casuarina Has Both Symbiotic and Nonsymbiotic 
Hemoglobin Genes 

Hemoglobin cDNA clones were isolated from a Casuarina nod- 
ule expression library in screens with a polyclonal antibody 
raised against Casuarina nodule hemoglobin (Fleming et ai. ( 
1987). The sequence of one of the clones (Hb-SymA; Figure 
1) has a deduced amino acid sequence almost identical to the 
protein sequence reported by Kortt et al. (1988) for purified 
nodule hemoglobin. Two other clones (Hb-SymB and Hb- 
SymC) had sequences not completely identical to Hb-SymA 
but showed at least 97% identity in the coding region (Figure 
1). These could be other members of a gene family or they 
could be allelic forms of the one gene. The "gene family" alter- 
native is supported by DNA gel blot analyses in which an 
Hb-SymA probe revealed a complex pattern of bands (Figures 
2A and 2B). BamHI digests produced nine hybridizing bands, 
four strong bands (2.2, 4, 6.5, and 9.5 kb), and five weakly 
hybridizing bands (3.9, 8, 12, 15, and 19 kb). We determined 
that these bands represent alleles of a smaller number of 
genes rather than nine different genes by analyzing ONA from 
individual seedlings (Figure 2B). Four of nine individual seed- 
lings (1, 2; 5, and 9) displayed a banding pattern identical to 
that of the pooled population, giving support to the multiple 
gene alternative. The remaining five plants had only slightly 
different hybridization patterns, suggesting some sequence 
polymorphism in the population. Only one of the strongly 

The deduced amino acid sequences of three cDNAs (Hb-SymA, Hb- 
SymB, and Hb-SymC) are compared with the amino acid sequence 
of the nodule hemoglobin Hbl (Fleming et al., 1987). Lowercase let- 
ters indicate the noncoding sequence. Dashes indicate identity with 
the amino acid encoded by cashb-syml. 
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legumes that also have three or four symbiotic leghemoglobin 
genes. 

Previously, we had isolated a hemoglobin gene from C giauca 
using a Parasponia hemoglobin gene as a probe (Christensen 
et al., 1991). This Casuarina gene showed a low level of de- 
duced amino acid similarity with the nodule protein. DNA gel 
blot analysis with this gene revealed a simple genomic pattern, 
with most restriction enzymes generating only one strongly 
hybridizing band and one weakly hybridizing band (Figure 2C). 
During the isolation of this gene, we had detected, in addition 
to the full-length gene, a fragment containing an incomplete 
gene sequence (Christensen et al., 1991); this fragment probably 
corresponds to the weaker band (Figure 2C). We concluded 
that C. giauca has only one functional nonsymbiotic hemo- 
globin gene and, probably, a truncated nonfunctional gene 
segment. 

We isolated 30 clones from a genomic library of 240,000 
plaque-forming units by using Hb-SymC as a probe. Seven 
of the clones mapped into three classes (Figure 3). Classes 
1 and 2 contained only a single gene, whereas class 3 clones 
(1,18, and 21) each contained two hemoglobin genes in oppo- 
site polarity, 7 kb apart. 

The sequence of one of the genes, cashthsyml (genomic 
clone 38) (Figure 1), has a deduced amino acid sequence iden- 
tical to that of Hb-SymB, implying that it is transcribed in 
nodules. The gene has three introns in exactly the same posi- 
tions as the introns of ali known plant hemoglobin genes 
(Jensen et al., 1981; Landsmann et al., 1986), including the 
Casuarina nonsymbiotic hemoglobin gene (Christensen et al., 
1991). 



Class 1 
clone 38 
clone 24 
clone 6 



Figure 2, DNA Get Blot Analysis of C. gtauca Genomic DNA. 

(A) Restriction digest hybridized to Hb-SymA cDNA is shown. 

(B) BamHI digests of nine individual seedlings (labeled above the gei) 
probed with Hb-SymA cDNA are shown. 

(C) Restriction digests hybridized to a fragment of the cashb-nonsym 
gene are shown. 

Molecular length markers are given at left in kilobases. Tot, total popu- 
lation of plants. 



hybridizing bands showed length polymorphism; in plant 8, 
the 6.5-kb band was missing but a new, strong band was seen 
at 11 to 12 kb. The four strongly hybridizing bands, which must 
represent sequences with high homology to the cDNA probe, 
are therefore likely to be from a set of genes present in each 
of the plants. The less intense bands could represent inactive 
pseudogenes with lower homology. Thus, Casuarina resembles 
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Figure 3. Diagram of Organization of Genomic Hemoglobin Segments 
Cloned from Casuarina. 

Seven clones were characterized, and these fall into three classes. 
Class 3 has two copies of the gene on each clone. The various probes 
used to map the genes are shown. B, E, and H are BamHI, EcoRI, 
and Hindi M restriction enzyme sites, respectively. 



216 The Plant Cell 



The Symbiotic and Nonsymbiotic Genes Have Different 
Expression Patterns 

RNA gel blot hybridization using the Hb-SymA probe on RNA 
isolated from a number of Casuarina tissues showed that the 
symbiotic genes are expressed at a high level in nodules. No 
expression was detectable in root, leaf, or stem tissue (Figure 
4A). The nonsymbiotic gene was expressed in root, leaf, and 
stem tissues and only at a barely detectable level in nodules 
(Figure 4B). The level of expression in the nodule was ~100- 
fold less than that of the symbiotic genes. The hemoglobin 
produced by the symbiotic gene must play a role in the Frankia 
symbiosis; this role is probably comparable to the oxygen trans- 
port role of leghemoglobin in rhizobia symbioses with legumes. 
In contrast, the nonsymbiotic hemoglobin protein is unlikely 
to be involved in nitrogen fixation, but rather to be involved 
in other metabolic processes in a variety of plant tissues. 



Casuarina Hemoglobin Genes Maintain Their 
Expression Patterns in Transgenic 
L. corniculatus 

The promoters of members of the two gene families were fused 
to a GUS reporter gene cassette and introduced into L cor- 
niculatus, a legume species (Stougaard et al., 1986). The 
constructs involving the promoters of two cashb-sym genes 
showed a high level of expression in the central, infected cells 
of the nodule (Figure 5A). The level of GUS activity generated 
by the promoter of the Casuarina symbiotic gene in the infected 
cells of the nodule is similar to that seen with the soybean 
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Figure 4. The Expression Pattern of Casuarina Hemoglobin Genes. 

(A) Gel blot analysis of RNA from various Casuarins tissues using a 
cashb-sym gene probe. 

(B) A cashb-nonsym gene was used to probe the same blot shown in (A), 
Equal amounts of total RNA, as determined at OD 2 6onm. from nodules, 
roots, and leaves/stems were hybridized. 28S and 18S, rRNAs. 



Ibc3 leghemoglobin promoter in transgenic L corniculatus 
(Lauridsen et al.» 1993). By using dark-field microscopy, a tech- 
nique that increases the sensitivity of detection of GUS reporter 
gene activity (Medberry et al., 1992), we were able to detect 
low-level expression in the cortical cells lying just outside the 
central infected zone (see pink cells, Figure 5B), but there was 
no expression detectable in the other cells of the cortex or in 
any of the cells of the vascular traces. Dark-field microscopy 
also showed low-level expression in some of the interstitial 
uninfected cells in the central zone of the nodule (Figure 5B). 
In addition to expression in the nodules of the transgenic 
roots, the cashb-sym promoter was active in the root cap cells 
of roots that had been cultured without exposure to Rhizobium 
(Figure 5H). 

The promoter of the nonsymbiotic gene also directed activ- 
ity of the GUS reporter gene in nodules, but only at a level 
detectable by the sensitive dark-fieJd technique (Figure 5D) 
and not in bright-field view (Figure 5C). Expression occurred 
in the nodule parenchyma cells between the endodermis and 
the infected central celts and in the vascular bundles of the 
nodule (Figure 5D). This pattern overlaps the expression pat- 
tern of the symbiotic genes, which are also expressed in ceils 
immediately surrounding the central zone. Dark-field analysis 
showed that the nonsymbiotic gene was also expressed in the 
uninfected but not the infected cells of the central zone (Fig- 
ure 5D). The blue color of the infected cells in Figure 5D is 
not due to GUS staining. The gene was active in the 
meristematic zone of root tips (Figure 5G) and in the paren- 
chyma internal to the endodermis and associated with the 
vascular stele of the root (cf. Figures 5E and 5F). 



The cashb-sym Promoters Have Nodulin Motifs 

in legumes, the promoters of hemoglobin genes and other nodu- 
lin genes contain two motifs, 5'-AAAGAT-3' and 5'-CTCTT-3', that 
are 6 bp apart in most cases (Figure 6). The sequences are 
critical for the nodule-specific expression of the genes 
(Stougaard et al,, 1987; Ramlov et al., 1993; Szczyglowski et 
al., 1994). The promoters of the cashb-sym genes also con- 
tain these same two nodulin motifs in similar relative positions 
(Figures 6 and 7). This suggests that these motifs have the 
same function in the hemoglobin genes of the Casuarina- 
Frankia symbiosis as they do in genes of the legume-rhizobia 
symbiosis. The regions surrounding the Casuarina nodulin 
boxes are AT rich and show some homology with the com- 
parable regions of the legume promoters (Figure 6). There is 
a second copy of the A A AG AT motif farther downstream in 
both cashb-sym promoters (Figure 7). 

We examined the functional importance of the nodulin mo- 
tifs in the Casuarina genes by 5' deletion analysis (Figure 8). 
Removal of the nodulin motif region (-252-bp deletion) dis- 
abled the cashb-syml promoter (Figure 8A). In this particular 
gene, there is a duplication of the segment including the nodulin 
motifs (-478 to -462 cf. -311 to -295; Figure 7); deletion 
of the proximal motifs (A344/186; Figures 8A and 8B) showed 
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that the remaining upstream motif sequence was not sufficient 
to promote gene activity in the nodule. If the distal motifs were 
disrupted (-474; Figure 8A), expression levels were reduced 
but were still detectable and the expression pattern was un- 
changed. The cashb-sym2 gene has only one copy of each 
of the two motifs, although it has levels of expression com- 
parable to those of the cashb-syrM gene (Figures 8A and 8B). 
The deletion analysis also showed that other upstream regions 
were important for high-level expression (Figure 8B). The 
removal of these regions did not alter specificity of expression 
but did alter the level of expression. 

The cashb-nonsym promoter contains sequences related 
but not identical to nodulin motifs (Figure 6), and the spacing 
(4 bp) between the motif sequences differs from that in the 
symbiotic genes (Figure 6). A 5' deletion analysis showed that 
the promoter region containing the motifs does not control gene 
activity of this nonsymbiotic gene (Figures 8C and 8D). Se- 
quences upstream of -1000 are important for both root and 
nodule expression. 



DISCUSSION 

Nitrogen-fixing symbioses between plants and bacteria occur 
in a number of dicotyledonous families. In a number of 
phylogenetic schemes, these nitrogen-fixing families do not 
seem to be closely related (Landsmann et al., 1986). The best 
characterized symbiosis is that between legumes and rhizo- 
bia. Only one non-legume symbiosis involving rhizobia occurs; 
this is in the genus Parasponia (Ulmaceae, the elm family). 
In other nitrogen-fixing families of plants, such as the 
Casuarinaceae and Myricaceae, the symbiotic relationship is 
with the Gram-positive actinomycete Frankia. The differing 
modes of invasion by the symbiotic organisms and the differ- 
ing morphologies of the nodules formed in the symbioses 
have suggested to a number of investigators that the rhizobia- 
and FranWa-based symbioses have evolved independently 
(Appleby et al., 1988; Schwintzer and Tjepkema, 1990). 

In both types of symbioses, the biochemistry is similar. Nitro- 
gen fixed by the microsymbiont is exchanged for photosynthate 
produced by the host plant. Both symbiotic systems employ 
hemoglobin to supply oxygen needed for microbial respira- 
tion at free oxygen concentrations sufficiently low to protect 
the oxygen-sensitive bacterial nitrogenase enzyme systems 
(Appleby, 1984). 

Initially, when plant hemoglobin was known only from le- 
gumes, it was thought that the gene may have been introduced 
into plants from the animal kingdom by some lateral evolution- 
ary mechanism (Appleby, 1974; Jeffreys, 1981; Hyldig-Nielsen 
et al., 1982). Although the amino acid sequences of plant and 
animal hemoglobins have limited sequence identity (13 to 15%), 
the key functional residues in the molecules are conserved, 
and the two introns occurring in most animal hemoglobins are 
in precisely the same positions as two of the three introns found 
in all plant hemoglobins (Jensen et al., 1981; Landsmann et 



al., 1986; Christensen et al., 1991). The position of the central 
third intron in plants had been predicted by Go (1981) on the 
basis of structural analysis of the animal kingdom hemoglo- 
bin molecule. These data and the fact that hemoglobin genes 
are known in several unrelated families of plants make the 
lateral gene transfer hypothesis unlikely. Instead, our obser- 
vations favor a single evolutionary origin of the hemoglobin 
genes of both the animal and plant kingdoms. 

In our current studies, we have isolated the symbiotic he- 
moglobin genes of Casuarina and shown that the members 
of this gene family bear striking similarities with leghemoglobin 
genes. They have the same major features of gene organiza- 
tion and the same nodule-specific expression, which is 
restricted to the central bacteroid-containing region; they also 
have nodulin sequence motifs in their promoters. In both le- 
gumes and Casuarina, the nodulin motif region is essential 
for gene activity in nodules. 

We have also found that when the Casuarina symbiotic genes 
are introduced into a legume (L corniculatus), they have a high 
level of expression in the bacteroid-infected zone of the nod- 
ule, indicating that the Casuarina promoter motifs are 
recognized by the transcriptional signal systems of the leg- 
ume, despite the different symbiotic microorganisms of the two 
systems. The expression in the infected cells may be depen- 
dent upon a signal produced by the invading bacteria, whether 
they are rhizobia or Frankia. De Billy et al. (1991) suggested 
the presence of such signals in legumes, and Welters et al. 
(1993) have identified a bacterial DNA binding protein inter- 
acting with the S. rostrata leghemoglobin promoter. We are not 
able to explain the significance of our observation of some 
gene activity in the root cap zone in noninfected transgenic 
roots (Figure 5H), although it could be a consequence of the 
A. rhizogenes transformation system; the transgenic hairy roots 
are known to have an altered hormone regime compared with 
normal roots (Schmulling et al., 1988). 

The leghemoglobin and the Casuarina symbiotic genes con- 
tain nodulin motifs at comparable relative positions in their 
promoters. Similarly spaced motifs have also been recorded 
in other nodulin genes with expression restricted to nodules 
(Sandal et al., 1987; Stougaard et al., 1990). The nonsymbi- 
otic hemoglobin gene of Casuarina does not have nodulin 
motifs closely matching the consensus motifs nor is their spac- 
ing comparable to that of the motifs of the symbiotic genes 
(Figure 6). This gene does not have a nodule-specific expres- 
sion pattern. It is active in a number of plant tissues and in 
the nodule, where it is expressed only in the vascular and inner 
cortical regions. The structure of the nonsymbiotic gene of 
Casuarina is similar to that of leghemoglobin genes in that it 
has the same intron positions, but its deduced amino acid se- 
quence more closely resembles that of the hemoglobin gene 
of the non-legume Parasponia, with 80% sequence identity 
and with an N-terminal extension to the protein sequence 
(Christensen et al., 1991) (Table 1), as is found in the Paraspo- 
nia gene. 

From existing data (Wittenberg et al., 1974; Fleming et al., 
1987; Gibson et al., 1989), it is reasonable to assume that the 





Figure 5. Expression of Chimeric Casuarina Hemoglobin Genes in L cornicufatus. 



Hemoglobin Genes in Casuarina 219 



cgTTGT|tTCTT 
TCTT 
TCTT 



-141 AAGTTTTTGA^AAGAT 
-133 AAATTTTTTAAAAGAT 
-128 AGGATTTTGAfcAAGAT - cATTGg 
-129 AAGATTTTGAAAAGATcatTTGg 
-200 AAGTTTGGCAAAAGAT-T-TTGT 
-202 AAATTTTTAAAAtGAT - TATTGT 
-202 AAATTTTTAAAAAGAT- TATTGT 
-149 AAGTTTTTGAAAAGtT-TtTTGT 
-152 AAGTTTTTTAAAtGAT- TATTGT 
-206 AAGTTTTTGAAAAGAT- TATTGT 
-201 AAGTTTTTTAAAtGAT -TATTGT 



-322 ACTTCAATCCcAAGATgTccTcT 
-489 ACTTCAATCCcAAGATcTccTcT 
-327 ACTTCAATCCcAAGATgTccTtT 



- 207 - - AATTGCCClfcAAGAala - - -TGg :TtTc 
•109 TAAAAAACCC AAAGATa - - -TGg :TCcc 
-117 AAAAAAACCC AAgGAga — TGgpTCTc 



gATTGT 



:tctttgattatgtt 
tcttaataatgtca 



:tctt 



TCTT 



:accatacca 
ttcatcatg 
gtcatgccg 
:atcatgccg 



\ATAATGTCA 



:tctt*ataactaca 
:tctttaataatgtc 
:tctttattgttgtc 
itctttaataacgtc 

tctuttgatattt 
:tctuttgatattt 
:tctt\ttgatattc 



-104 
-96 
-91 
-91 
-164 
-165 
-165 
-112 
-115 
-169 
-164 

-284 
-451 
-289 



---GACCCAC -177 
- -CAATACC - 77 
— CAGTACC - 85 



lbc3 Glycine max a 

Iba Glycine max 3 

lbcl Glycine max 3 

Ibc? Glycine max a 

lb Pi sum sativum* 3 

glbZ Sesbania rostrata c 

gjb3 Sesbania rostrata 0 

Ibl Medicago truncatula^ 

IbZ Medicago truncatula 6 

Msl Medicago sativa e 

plb2 Medicago sativa e 

hb-syml Casuarina glauca 
hb-syml Casuarina glauca 
hb-sym2 Casuarina glauca 

hb-non-sym Casuarina glauca 1 
hb Parasponia andersoni i 1 
hb Trema tomentosa* 



Figure 6. Alignment of Promoter Sequences from Different Symbiotic and Nonsymbiotic Hemoglobin Genes. 

The nodulin motif sequences are boxed. Uppercase letters between and in the boxed sequences indicate identity with the consensus sequence 
determined for nodule genes. Numbers indicate the distance from the transcription initiation site, except for the Casuarina sequences, in which 
numbers indicate the distance from the translation initiator ATS. References indicated by superscript letters are as follows: a, Stougaard et ai. 
(1987); b, Nap (1988); c, Metz et al. (1988); d, Gallusci et al. (1991); e, Davidowitz et al. (1991); f, Christensen et al. (1991). The cashb-sym sequences 
are from this study. 



hemoglobins in the nodules of legumes, Casuarina, and 
Parasponia all function as oxygen carriers in the symbiotic in- 
teraction. The nonsymbiotic hemoglobin in Casuarina also has 
all the sequence and structural characteristics of an oxygen 
carrier. The fact that its expression is restricted to plant tis- 
sues where oxygen supply is likely to be limiting adds support 
to this possibility. Recently, it has been reported that a hemo- 
globin gene exists in barley and that it is induced by hypoxic 
conditions (Taylor et al., 1994), supporting an oxygen trans- 
port role. We had earlier suggested another possible function 
for hemoglobins in plants as oxygen sensors rather than oxy- 
gen carriers (Appleby et al., 1988). The oxygen sensor (Fix 
L protein) of R. meiiloti has been shown to be a hemoprotein 
with kinase activity (Gilles-Gonzales et al., 1991). Until we have 
mutant plants available, we will not be certain of the function 
of plant hemoglobins in nonsymbiotic tissues. 



Evolution of Hemoglobin Genes In Plants 

The sequence and structural identities of Casuarina nonsym- 
biotic hemoglobin and Parasponia hemoglobin suggest a direct 
evolutionary lineage for these genes. Previously, we had sug- 
gested that it was likely that all plants have a hemoglobin gene 
and predicted that in addition to the symbiotic leghemoglobins 
known in legumes, there may be another gene or gene family 
in legumes encoding hemoglobins operative in nonsymbiotic 
plant tissues (Appleby et al., 1988; Bogusz et al., 1988). This 
concept has been given additional support by the findings of 
hemoglobin genes in barley and maize (Taylor et al., 1994). 

In Casuarina, the nonsymbiotic hemoglobin gene could have 
given rise to the specialized symbiotic hemoglobin gene fam- 
ily by gene duplication and subsequent sequence divergence. 
In Parasponia, the requirement for high-level expression in 



Figure 5. (continued). 

Tissue was stained, and thin sections were cut and examined by bright- or dark-field microscopy. Expression of the cashb*syml promoter-GUS 
fusion is shown in (A), (B), and (H). 

(A) Bright-field micrograph of a nodule cross-section showing GUS activity (blue) in the symbiont-infected cells. 

(B) Dark-field micrograph of the same section shown in (A). A high level of GUS activity is shown in the infected cells (blue), and a low level 
of GUS activity (pink) is visible in uninfected interstitial cells of the central zone and in the inner cortical cells immediately surrounding the central zone. 

(C) to (G) Expression of the cashb-nonsym promoter-GUS fusion. (C) is a bright-field micrograph of a nodule cross-section showing no visible 
GUS activity. (D) is a dark-field micrograph of the same section shown in (C). GUS activity (pink) is now visible in the central zone cells between 
the infected cells and in the inner cortex of the nodule parenchyma. The blue appearance of the bacteroids is not the result of GUS expression. 
A higher level of activity may be present in the nodule vascular tissues. (E) shows a cross-section through a root with a lateral root (bright-field). 
(F) is a dark-field micrograph of the same section shown in (E). GUS activity (pink) occurs in the stelar parenchyma of the main root and lateral 
root; a higher level of activity may be present in the vascular ceils. Expression in the meristematic zone of a root tip is shown in (G). 

(H) Expression of cashb-syml in the root cap celts of a root not infected with Rhizobium, 
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TTATAAAAAT GCAATAATCC CTCUCGTTG CTTCTTCTAA CTTGCTTAAA AAAATATTAA 60 

CATCAATCAA AATTAACTGC TTCAACCACT TCAACATTXSA CTACTAAAA- TCCAAATGTC 119 
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CTTTAAACAA ATGACTACGA ACACTTAACT TAGATCAAAC ATAACATCCT AATCACTTTA 119 
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ATTTGAACAA CAACAAAGAT AAACAACCAT TATCCCTACC AAOCAAGTAA CTTGTAGAAA 299 
AAAAGAAAAA AGAAAAAAGC AAAAACCATC CTTTAAAAAC CCAAAACCCA CCa|cAATTCC 359 



AACCC-TTTG ATCAACTTCA ATCCCAA£AI_ CTXXTTCT^^QJTATTGATAT _^GAAC^ACA 418 
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GAAATCTAGC TCAACCTTTA TTTATTGCAT GTCTTTCTCA ATTTCTAATT TTCTTCCC1& 595 



TATATA GATT GGTGTTTTTA CATATGTTAT TAGTGAGAAG TAGAAGCATA AGCAAAAGCT 655 



AACAAAGTGA CTTTGTGAGC TTGTGAGAGA --GAGACAAA GAAATG 69 9 
**C AA**«*T 

Figure 7. Sequence of the Promoter of Casuarina Symbiotic Hemo- 
globin Genes. 

The sequence of the cashb-syrM promoter is aligned with that of the 
cashb-sym2 promoter An asterisk indicates an identical base; dashes 
were introduced to optimize alignment; dots indicate a deletion. The 
158-bp duplication of cashb-syrM is indicated by boxes. The TATA box 
with nodulin motifs is underlined. The proposed CACCCT box is dou- 
ble underlined. 



symbiotic nodules and lower expression in nonsymbiotic tissue 
has apparently been achieved with a single gene with differen- 
tial expression in nodules and nonsymbiotic tissues of the plant. 

Symbiotic hemoglobin genes could have arisen indepen- 
dently in rhizobia and Frankia symbioses, or alternatively, a 
gene duplication event could have happened before the diver- 
gence of the legume and Casuarina plant families. The 
hypothesis of independent origins of the symbiotic genes in 
the two families has some support in the observation that 
Casuarina symbiotic hemoglobin shows more sequence 
similarity with Casuarina nonsymbiotic hemoglobin than with 
soybean symbiotic hemoglobins (53 versus 44% identity; Table 
1). However, two lines of evidence favor a common origin of 
the symbiotic genes in legumes and Casuarina, First, the tran- 
scription factors in the L corniculatus nodule are compatible 
with the Casuarina promoter sequences, and comparable 
nodulin motifs are critical in both leghemoglobin and in the 
cashb-sym genes. Second, both symbiotic genes lack the 
N-terminal extension found in all known nonsymbiotic genes. 



The recruitment of preexisting genes for nodule-specific 
function by gene duplication is likely to have occurred with 
genes other than the hemoglobin gene. Glutamine synthase, 
glutamate synthase, the peribacteroid membrane proteins 
N-23, N-24, and N-26, and early nodulins all show differently 
regulated genes with either nodule-specific or non-nodule- 
specific expression patterns (Nap and Bisseling, 1990; Miao 
and Verma, 1993). This contrasts to the situation in Parasponia, 
in which a single gene is expressed differently in nodule and 
host plant tissues. It may be that Parasponia, the legumes, 
and Casuarina share a common ancestor that was involved 
in the early evolution of symbiotic nitrogen fixation and that 
subsequently there have been at least two different strategies 
of specialization for genes involved in nodule function in fami- 
lies of flowering plants. 

METHODS 



RNA/DNA Isolation Method from Casuarina 

RNA was prepared from Casuarina glauca nodules by a modification 
of the method of Hughes and Galau (1988). Plant tissue (5 to 10 g) 
was ground to a fine powder in liquid nitrogen and sprinkled into 55 
mL of cold buffer (one-tenth volume of TE3D [200 mM Tris-HCI t pH 
85, 300 mM LiCI, 10 mM Na 2 EDTA, 1.5% lithium dodecylsulfate, 1% 
w/v sodium deoxycholate, 1% v/v Nonidet P-40], 5% w/v insoluble PVR 
90 mM mercaptoethanol, 10 mM DTT, 0.1% DEPC) and stirred to en- 
sure immediate contact with the buffer. After stirring for 5 to 10 min, 
46 mL of 3 M ammonium acetate was added, and the extract was spun 
at 5000g for 20 min. RNA/DNA was precipitated from the supernatant 
with one-tenth volume of 3 M sodium acetate and one-half final vol- 
ume of isopropanol and was spun at SOOOg for 30 min. The pellet was 
resuspended in 5 to 10 mL of H 2 0 and purified by phenol-chloroform 
extraction; this process was repeated until the preparation looked clean. 
The preparation was finished with a chloroform extraction. RNA was 
precipitated with one-quarter volume of 10 M LiCI on ice for 2 to 12 
hr, followed by a spin at lO.OOOg for 30 min. DNA was recovered from 
the supernatant by ethanol precipitation. The RNA pellet was resus- 
pended, ethanol precipitated, and washed in 70% ethanol. 



Isolation of cDNAs 

Poly(A) + RNA was fractionated from nodule RNA, A nodule cDNA li- 
brary in XZAPII (Stratagene) was constructed from 5 ug of poly(A) + 
RNA (cDNA synthesis kit; Pharmacia). One hundred and twenty plaque- 
forming units were screened using antibodies raised against Casua- 
rina hemoglobin isolated from nodules (Fleming et al., 1987). Three 
positive clones were isolated and sequenced using Applied Biosystems 
(Foster City, CA) automatic sequencing system and standard sequenc- 
ing procedures. The EMBL accession numbers for Hb-SymA, Hb-SymB, 
and Hb-SymC are X77694, X77695, and X77696, respectively. 



DNA Gel Blot Analysis 

Genomic DNA was isolated from Casuarina seedlings essentially as 
described by Hughes and Galau (1988) and digested with enzymes 
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Figure 8. Deletion Analysis of Casuarina Hemoglobin Promoters. 

(A) Diagrams are shown of the 5' regions of cashb-syml and cashb-sym2 promoter-GUS constructs together with the 5' and internal deletions 
of cashb-syml that were transferred into Lotus plants. 

(B) GUS activity of cashb-syml in nodules measured by fluorometric assay is given as nanomoles of 4-methylumbeiiiferone per milligram of 
protein per hour (vertical axis). Promoter constructs are identified on the horizontal axis. 

(C) The 1700-bp cashb-nonsym promoter-Gl/S construct (5' region) together with the 5' deletions that were introduced into Lotus are shown. 

(D) GUS activity in nodules (filled columns) and roots (open columns) were measured by fluorometric assay and are given as nanomoles of 
4-methylumbelliferone per milligram of protein per hour (vertical axis). Deletion end points are given as nucleotides from the translation initiator 
ATG. The 95% confidence intervals are indicated above the columns. 



as indicated. The digests were electrophoresed on a 0B% agarose 
gel. Blotting and hybridization were according to Sambrook et al. (1989). 
The symbiotic hemoglobin probe was the Hb-SymA cDNA insert, and 
the nonsymbiotic hemoglobin probe was a 900-bp Hindi 1 1 fragment 
extending from the third exon downstream in the cashb-nonsym gene 
(Christensen et al., 1991). 



RNA Gel Blot Analysis 

Total RNA was isolated from Casuarina nodules, leaves/stems, and 
roots of uninfected plants by a modification of the method of Hughes 
and Galau (1988). Equal amounts of RNA, as determined by OD^nm. 



Table 1 . Amino Acid Homology between Plant Hemoglobins 




Casuarina 


Casuarina 


Parasponia 


Plant 
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hb-sym 


hb 


Hemoglobin 


(%) 
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(%) 


Soybean lba b 


43 


44 


40 


Parasponia hb c 


80 


52 




Casuarina hb-sym d 


53 






* Christensen et al. (1991). 








b Hyldig-Nielsen et al. (1982). 






c Blfolk (1972); Landsmann et al. (1986). 






d Kortt et al. (1988). 









222 The Plant Cell 



of each sample were subjected to electrophoresis on a 15% formal- 
dehyde agarose gel, blotted, and hybridized. ^P-labeled RNA probes 
were the Hb-SymA cDNA and a 600-bp genomic Hpal-Hindlll fragment 
containing the third exon of the cashb-nonsym gene (Christensen 
et af., 1991). 



Isolation of Genomic Clones 

A genomic library in XEMBL4 was made from 15- to 20-kb size- 
fractionated DIMA of a partial Sau3A digest of Casuarina DN A (Sambrook 
et al., 1989). Approximately 240,000 plaque-forming units were screened 
with a ^P-labeled Hb-SymA cDNA probe. Eight genomic clones were 
isolated and characterized by restriction mapping. A fragment con- 
taining the complete coding region of cashb-syml and a 1.45-kb 
promoter region (EMBL accession number L28826) was subcloned 
into pUC118 and pUCH9 and sequenced using the Applied Biosystems 
automatic sequencing system and standard sequencing procedures. 
A 2.3-kb genomic fragment containing the promoter of a second gene, 
cashb-sym2 (EMBL accession number X77693), was subcloned, and 
450 bp of the proximal promoter was sequenced. 

Construction of Chimeric ^-Glucuronidase Genes 

The 1.45-kb fragment containing the cashb-syml promoter and the 
23-kb fragment containing the cashb-sym2 promoter were cloned into 
the pALTER vector. A Bglll restriction site was generated by mutation 
at the translation initiator ATG using the Promega in vitro mutagene- 
sis kit. The mutated cashb-syml promoter was sequenced to ensure 
that the mutation was present. The mutated promoters were cloned 
using the Hindlll-Bglll sites into Hindlll-BamHI of the plV20 vector 
(Hansen et al., 1989). The 5' deletion series was generated using the 
double-stranded nested deletion kit from Pharmacia, and end points 
were determined by sequencing before cloning the promoter deletions 
as Bglll end-filled EcoRI fragments into the BamHI end-filled Sail site 
of the plV20 vector. The cashb-nonsym promoter used is ~1.7 kb long 
and was cloned as an Xbal-Bglll fragment into the Xbal-BamHI site 
of the vector plV20. Using the following restriction sites present in the 
promoter, 5' promoter deletions were generated: Bglll, Spel, and Xhol 
sites that were ligated to BamHI, Xbal, and Sail sites of the plV20 
vector, respectively. 



Plant Transformation 

Gene constructs were transferred into Agrobacterium rhizogenes as 
described by Van Haute et al. (1983). The AR12 strain carrying the 
cauliflower mosaic virus 35S-chloramphenicol acetyltransferase gene 
construct in the left T-DNA segment (Hansen et al., 1989) was used 
as the vector throughout this study. Lotus corniculatus was transformed, 
regenerated, and nodulated as described previously (Stougaard et al., 
1986, 1987; Petit et al.. 1987). 



Biochemical Assays 

Activity from the (5-glucuronidase gene was measured in seven to 12 
plants of each construct by fluorometric assay (Jefferson et al., 1987). 
Protein levels in extracts were determined by the dye binding assay 
of Specter (1978). Chloramphenicol acetyltransferase assays for 



transformation were performed as described previously (Stougaard 
et al., 1986). 
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